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(BORBC'dPCryPTD 19 JUL 2002 ^ 



FORM PTO-1 390 (Modified) U S DEPARTMENT OF COMMERCE PATENT AND TRADEMARK OFFICE lATTORNEY'S DOCKET NUMBER 

EV593) TRANSMITTAL LETTER TO THE UNITED STATES 078883-0146 
DESIGNATED/ELECTED OFFICE (DO/EO/US) 
CONCERNING A FILING UNDER 35 U.S.C. 371 



INTERNATIONAL APPLICATION NO. 
PCT/GBOO/03837 



TITLE OF INVENTION 

PRODUCER CELL FOR THE PRODUCTION OF RETROVIRAL VECTORS 



APPLICANT(S) FOR DO/EO/US 

Jason SLINGSBY , Susan Mary KINGSMAN, Jonathan ROHLL and Andrew SLADEI. , 

Applicant herew i th submits to the United States Designated/ E lec ted Office L)0/LO/US) the following items and other information: 



INTERNATIONAL FILING DATE 

10/05/2000 



U S APPLICATION NO (if known, see 37 C F R 1 5) 

10/088,076 



PRIORITY DATE CLAIMED 
10/05/1999 



/ 

y 



□ This is a FIRST submission of items concerning a filing under 35 U.S.C. 371 . 

2. El This is a SECOND or SUBSEQUENT submission of items concerning a filing under 35 U.S.C. 371 . 

3 □ This express request to begin national examination procedures (35 U.S.C. 371 (f)) at any time rather than delay 
examination until the expiration of the applicable time limit set in 35 U.S.C. 371 (b) and PCT Articles 22 and 39(1 ). 

4. □ A proper Demand for International Preliminary Examination was made by the 1 9 th month from the earliest claimed 
priority date. 

5. □ A copy of the International Application as filed (35 U.S.C. 371 (c)(2)) 

□ is transmitted herewith (required only if not transmitted by the International Bureau). 

□ has been transmitted by the International Bureau. 

□ is not required, as the application was filed in the United States Receiving Office (RO/US) 

6. □ A translation of the International Application into English (35 U.S.C. 371 (c)(2)).. 

□ Amendments to the claims of the International Application under PCT Article 19 (35 U.S.C. 371(c)(3)) 

□ are transmitted herewith (required only if not transmitted by the International Bureau). 

□ have been transmitted by the International Bureau. 

□ have not been made; however, the time limit for making such amendments has NOT expired. 

□ have not been made and will not be made. 

8. □ A translation of the amendments to the claims under PCT Article 1 9 (35 U.S.C. 371 (c)(3)). 
E>3 An oath or declaration of the inventor(s) (35 U.S.C. 371 (c)(4)). 

10 □ A translation of the annexes to the International Preliminary Examination Report under PCT Article 36 (35 U.S.C. 
371(c)(5)). 

11. Q Applicant claims small entity status under 37 CFR 1 .27 



Items 12. to 17. below concern other document(s) or information included: 

12. □ An Information Disclosure Statement under 37 CFR 1 .97 and 1 .98. 

13. El An assignment document for recording. A separate cover sheet in compliance with 37 CFR 3.28 and 3.31 is included 

14. □ A FIRST preliminary amendment. 
A SECOND or SUBSEQUENT preliminary amendment. 

15. □ A substitute specification. 

1 6. □ A change of power of attorney and/or address letter. 

1 7 M Other items or information: Preliminary Amendment; Statement to Support Filing and Submission; Sequence Listing 

(54 pgs.); and Computer 
Readable format 
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U.S. APPLICATION NO (If known, see 37 C F R 1 50 

10/088,076 


INTERNATIONAL APPLICATION NO 

PCT/GB00/03837 


ATTORNEY'S DOCKET NUMBER 

078883-0146 


18. [x]The following fees are submitted: 


CALCULATIONS 


PTO USE ONLY 


Basic National Fee (37 CFR 1.492(a)(1)-(5): 

Search Report has been prepared by the EPO or JPO $890.00 




International pi 
(37 CFR 1.482 


eliminary examination fee paid to USPTO 

) $710 


.00 




No international preliminary examination fee paid to USPTO (37 CFR 1 .482) 

but international search fee paid to USPTO (37 CFR 1.445(a)(2) $740.00 


Neither international preliminary examination fee (37 CFR 1 .482) nor 

International search fee (37 CFR 1.445(a)(2)) paid to USPTO $1,040.00 


International preliminary examination fee paid to USPTO (37 CFR 1 .482) 

and all claims satisfied provisions of PCT Article 33(2)-(4) $100.00 


ENTER APPROPRIATE BASIC FEE AMOUNT = 






Surcharge of $130.00 for furnishing the oath or declaration later than 20 
Months from the earliest claimed priority date (37 CFR 1 .492(e)) 


$130.00 




Claims 


Number Filed 




Included in Basic 
Fee 




Extra 
Claims 




Rate 




Total Claims 






20 




0 


X 


$18.00 


$0.00 




Independent 
Claims 






3 




0 


X 


$84.00 


$0.00 




Multiple dependent claim(s) (if applicable) 




$280.00 






TOTAL OF ABOVE CALCUI 


JVTIONS = 


$130.00 




Reduction by 14 for filing by small entity, if applicable. 


$0.00 




SUBTOTAL = 


$130.00 




Processing fee of $130.00 for furnishing English translation later the 20 

months from the earliest claimed priority date (37 CFR 1 .492(f). + 






TOTAL NATIONAL FEE = 


$130.00 




Fee for recording the enclosed assignment (37 CFR 1 -21(h)). The assignment must be 
accompanied by an appropriate cover sheet (37 CFR 3.28, 3.31). $40.00 per property + 








TOTAL FEES ENCLOSED = 


$130.00 




cvvc on\cix 0000 
154 


130,00 OP 






Amount to be: 
refunded 


$ 










charged 


$ 



07/23 
01 FC 



a. EE3 A check in the amount of $130.00 to cover the above fees is enclosed. 

b. □ Please charge my Deposit Account No. 19-0741 in the amount of $1 30.00 to the above fees. A duplicate copy of this sheet 
is enclosed. 

c. The Commissioner is hereby authorized to charge any additional fees which may be required, or credit any 
overpayment to Deposit Account No. 19-0741 . A duplicate copy of this sheet is enclosed. 

NOTE: Where an appropriate time limit under 37 CFR 1 .494 or 1 .495 has not been met, a petition to revive (37 CFR 
1.137(a) or (b)) must be filed and granted to restore the application to pending status. 



SEND ALL CORRESPONDENCE TO: 

Foley & Lardner 

Customer Number: 22428 



SIGNATURE 



22428 



NAME MlCHELE M. SlMKIN 



REGISTRATION NUMBER 34,71 7 



PATENT TRADEMARK OFFICE 
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fto^ttpewro 2 OMAR 2002 



FORM PTO-1 390 (Modified) U S DEPARTMENT OF COMMERCE PATENT AND TRADEMARK OFFICE 

(REV 5-93) 

TRANSMITTAL LETTER TO THE UNITED STATES 
DESIGNATED/ELECTED OFFICE (DO/EO/US) 
CONCERNING A FILING UNDER 35 U.S.C. 371 



ATTORNEY'S DOCKET NUMBER 



US APPLICATION! 

UnassigneJ" \t * •■— ' 



078883-0146 

/ 088 076 




INTERNATIONAL APPLICATION NO. 
PCT/GB00/03837 



INTERNATIONAL FILING DATE 
5 October 2000 



PRIORITY DATE CLAIMED 
5 October 1999 



[TITLE OF INVENTION 

PRODUCER CELL FOR THE PRODUCTION OF RETROVIRAL VECTORS 



APPLICANT(S) FOR DO/EO/US 

Jason SLINGSBY Susan Mary KINGSMAN Jonathan ROHILL and Andrew SLADE 

Applicant herewith submits to the United States Designated/Elected Office (DO/EO/US) the following items and other information: 

1 . This is a FIRST submission of items concerning a filing under 35 U.S.C. 371 . 

2. □ This is a SECOND or SUBSEQUENT submission of items concerning a filing under 35 U.S.C. 371 . 

3. □ This express request to begin national examination procedures (35 U.S.C. 371 (f)) at any time rather than delay 
examination until the expiration of the applicable time limit set in 35 U.S.C. 371(b) and PCT Articles 22 and 39(1). 

4- El A proper Demand for International Preliminary Examination was made by the 1 9 th month from the earliest claimed 
priority date. 

5. [3 A copy of the International Application as filed (35 U.S.C. 371(c)(2)) 

□ is transmitted herewith (required only if not transmitted by the International Bureau), 
has been transmitted by the International Bureau. 

□ is not required, as the application was filed in the United States Receiving Office (RO/US) 

6. □ A translation of the International Application into English (35 U.S.C. 371 (c)(2)). 

7. £3 Amendments to the claims of the International Application under PCT Article 19 (35 U.S.C. 371 (c)(3)) 

□ are transmitted herewith (required only if not transmitted by the International Bureau). 
O have been transmitted by the International Bureau. 

□ have not been made; however, the time limit for making such amendments has NOT expired. 
£<] have not been made and will not be made. 

A translation of the amendments to the claims under PCT Article 1 9 (35 U.S.C. 371 (c)(3)). 
An oath or declaration of the inventor(s) (35 U.S.C. 371(c)(4)). 

A translation of the annexes to the International Preliminary Examination Report under PCT Article 36 (35 U.S.C. 
371(c)(5)). 

|1 1. □ Applicant claims small entity status under 37 CFR 1 .27 . 



8 □ 

9. □ 

10. □ 



12. 


E) 


13. 


□ 


14. 


EI 




□ 


15. 


□ 


16. 


□ 


17. 


El 



An Information Disclosure Statement under 37 CFR 1.97 and 1 .98. 

An assignment document for recording. A separate cover sheet in compliance with 37 CFR 3.28 and 3.31 is included. 

A FIRST preliminary amendment. 

A SECOND or SUBSEQUENT preliminary amendment. 

A substitute specification. 

A change of power of attorney and/or address letter. 

Other items or information: Application Data Sheet (4 pages) 
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JC13Rec'dPCT/PT0 2 0 MAR 2002 



ATTORNEY'S DOCKET NUMBER 

078883-0146 



U.S. APPLICATION 

Unassigned 



INTERNATIONAL APPLICATION NO 

PCT/GB00/03837 



18. ^The following fees are submitted: 



CALCULATIONS ^o use only 



Basic National Fee (37 CFR 1 .492(a)(1)-(5): 

Search Report has been prepared by the EPO or JPO $890.00 



International preliminary examination fee paid to USPTO 

(37 CFR 1.482) $710.00 



No international preliminary examination fee paid to USPTO (37 CFR 1 .482) 

but international search fee paid to USPTO (37 CFR 1 .445(a)(2) $740.00 



Neither international preliminary examination fee (37 CFR 1 ,482) nor 

International search fee (37 CFR 1.445(a)(2)) paid to USPTO $1,040.00 



International preliminary examination fee paid to USPTO (37 CFR 1 .482) 

and all claims satisfied provisions of PCT Article 33(2)-(4) $100.00 



ENTER APPROPRIATE BASIC FEE AMOUNT = 



$890.00 



Surcharge of $130.00 for furnishing the oath or declaration later than 20 
Months from the earliest claimed priority date (37 CFR 1 .492(e)) 



Claims 



Number Filed 



Included in Basic 
Fee 



Extra 
Claims 



Rate 



Total Claims 



48 



20 



28 



$18.00 



$504.00 



Independent 
Claims 



$84.00 



$84.00 



Multiple dependent claim(s) (if applicable) 



$280.00 



TOTAL OF ABOVE CALCULATIONS = 



$1478.00 



Reduction by V* for filing by small entity, if applicable. 



$0.00 



SUBTOTAL = 



$1478.00 



Processing fee of $130.00 for furnishing English translation later the 20 
months from the earliest claimed priority date (37 CFR 1 .492(f). 



TOTAL NATIONAL FEE = 



$1478.00 



Fee for recording the enclosed assignment (37 CFR 1 .21(h)). The assignment must be 
accompanied by an appropriate cover sheet (37 CFR 3.28, 3.31). $40.00 per property + 



TOTAL FEES ENCLOSED = 



$1478.00 



Amount to be: 
refunded 



$ 



charged $ 



a. C3 A check in the amount of $1478.00 to cover the above fees is enclosed. 

b- □ Please charge my Deposit Account No. 19-0741 in the amount of $0.00 to the above fees. A duplicate copy of this sheet is 
enclosed. 

c. ^ The Commissioner is hereby authorized to charge any additional fees which may be required, or credit any 
overpayment to Deposit Account No. 19-0741 . A duplicate copy of this sheet is enclosed. 

NOTE: Where an appropriate time limit under 37 CFR 1 .494 or 1.495 has not been met, a petition to revive (37 CFR 
,1 .137(a) or (b)) must be filed and granted to restore the application to pending status. 



g status A j 



SEND ALL CORRESPONDENCE TO: 

Foley & Lardner 
Customer Number: 22428 



iNA^U 



SIGNATURE 



NAME BERNHARD D. SAXE 




22428 



PATENT TRADEMARK OFFICE 



REGISTRATION NUMBER 28,665 
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IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 

Atty. Docket No: 078883-0146 

Til re patent application of 
SLINGSBY, JASON et al . 
Serial No. 10/088,076 
Filed: March 20, 2002 

For: PRODUCER CELL FOR THE PRODUCTION OF RETROVIRAL VECTORS 

STATEMENT TO SUPPORT FILING AND SUBMISSION IN 
ACCORDANCE WITH 37 C.F.R. §§ 1.821-1.825 



Assistant Commissioner for Patents 
Washington, D.C. 20231 
Box SEQUENCE 

Sir : 

In connection with a Sequence Listing submitted concurrently 
herewith, the undersigned hereby states that: 

1. the submission, filed herewith in accordance with 37 
C.F.R. § 1.821(g), does not include new matter; 

2 . the content of the attached paper copy and the 
attached computer readable copy of the Sequence Listing, submitted in 
accordance with 37 C.F.R. § 1.821(c) and (e) , respectively, are the same; 
and 

3. all statements made herein of their own knowledge are 
true and that all statements made on information and belief are believed to 
be true; and further, that these statements were made with the knowledge 
that willful false statements and the like so made are punishable by fine 
or imprisonment, or both, under Section 1001 of Title 18 of the United 
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Serial No. 10/088,076 

States Code and that such willful false statements may jeopardize the 
validity of the application or any patent resulting therefrom. 



HARBOR CONSULTING 

Intellectual Property Services 
1500A Lafayette Road 
Suite 262 
Portsmouth, N.H. 
800-318-3021 



Respectfully submitted, 




Jama 
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PATENT 

ATTORNEY DOCKET NO.: 078883/0146 

IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 

In re Application of: 
Slingsby et al. 
Appln.No.: 10/088,076 
Filed: March 20, 2002 



Group Art Unit: Unknown 
Examiner: Unknown 



For: PRODUCER CELL FOR THE PRODUCTION OF RETROVIRAL VECTORS 

Commissioner for Patents 
Washington, D.C, 20231 

Sir: 

PRELIMINARY AMENDMENT 

In response to the Notice to Comply with Requirements for Applications Containing 
Sequence Disclosures, mailed on May 20, 2002, please amend this application as follows. 

IN THE SPECIFICATION: 

Please replace the following paragraph with the amended version that follows. In 
accordance With 37 C.F.R. § 1.21(c)(2), a marked-up version of the specification is set forth 
at the end of this response under the heading " Marked-up Version of the Specification ." 

Please delete the paragraph at page 19, lines 17-21, and replace it with the following 
paragraph: 

Figure 16 shows the alignment of leader and gag regions present in vectors pONY4Z (SEQ ID 
NO: 63), 8Z (SEQ ID NO: 64) and ATG mutated 8Z (SEQ ID NO: 65) vector. The latter is 
referred to as pONY8ZA. The sequences aligned are from the Narl site in the leader to the Xba 
site between the EIAV gag sequence and the CMV promoter. Sequences in the leader are 
shown in italic and a space is present upstream of the position of the gag ATG; and 
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ATTORNEY DOCKET NO.: 078883/0146 
APPLN. NO.: 10/088,076 



REMARKS 



Applicants respectfully request formal examination of this application. 

Applicants have amended the specification to identify amino acid sequences 
according to their SEQ ID NOs. As the amendments do not introduce new matter, entry 
thereof by the Examiner is respectfully requested. 

Favorable reconsideration of the application as amended is respectfully requested. 
The Examiner is invited to contact the undersigned by telephone if it is felt that a telephone 
interview would advance the prosecution of the present application. 

If there are any fees due in connection with the filing of this Preliminary Amendment, 
please charge the fees to our Deposit Account No. 19-0741. If a fee is required for an 
extension of time under 37 C.F.R. § 1.136 not accounted for above, such an extension is 
requested and the fee should also be charged to our Deposit Account. 



Respectfully submitted, 



Date July 19, 2002 
FOLEY & LARDNER 
Washington Harbour 




3000 K Street, N.W., Suite 500 
Washington, D.C. 20007-5109 
Telephone: (202) 672-5538 
Facsimile: (202) 672-5399 



Michele M. Simkin 
Attorney for Applicant 
Registration No. 34,717 
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ATTORNEY DOCKET NO.: 078883/0146 
APPLN. NO.: 10/088,076 

MARKED UP VERSION OF THE SPECIFICATION 

Please delete the paragraph at page 19, lines 17-21, and replace it with the following 
paragraph: 

Figure 16 shows the alignment of leader and gag regions present in vectors pONY4Z (SEQ ID 
NO: 63) , 8Z (SEP ID NO: 64) and ATG mutated 8Z (SEP ID NO: 65) vector. The latter is 
referred to as pONY8ZA. The sequences aligned are from the Narl site in the leader to the Xba 
site between the EIAV gag sequence and the CMV promoter. Sequences in the leader are 
shown in italic and a space is present upstream of the position of the gag ATG; and 
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Atty. Dkt. No. 078883-0146 



IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 

Applicants: Jason SLINGSBY et al. 

Title: PRODUCER CELL FOR THE 

PRODUCTION OF RETROVIRAL 
VECTORS 

Appl. No.: Unassigned 

Filing Date: 03/20/2002 

Examiner: Unassigned 

Art Unit: Unassigned 



- a 



PRELIMINARY AMENDMENT 

Commissioner for Patents 
Box PCT 

Washington, D.C. 20231 

Sir: / 

/ 

Prior to examination of the above-identified application, Applicants respectfully 
request that the above-identified application be amended as follows: 

In the Claims: 

Please cancel claims 42 and 50. 

In accordance with 37 C.F.R. §1 .21, please substitute for claims 3, 4, 6 - 8, 10 
- 13, 15, 16, 22, 24, 26, 27, 30, 34, 38, 40, 41, 43, and 47 - 49, the following 
rewritten version of the same claims. The changes are shown explicitly in the attached 
"Marked-up Version of Amended Claims." 

3. (Once Amended) A method according to claim 1, wherein the 
construct further comprises the 5'LTR and/or the packaging signal. 
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Atty. Dkt. No. 078883-0146 



4. (Once Amended) A method according to claim 1, wherein the 
construct LTR is a heterologous regulatable LTR. 



6. 



(Once Amended) A method according to claim 1, wherein the 



construct LTR is inactive. 



7. 



(Once Amended) A method according to claim 1 , wherein the 



provirus comprises an NOI encoding a selectable marker, which NOI is flanked by 
recombianse recognition sites. 



provirus comprises an internal 5' LTR upstream of the recombinase site or the 5' 
recombinase site where there is more than one site. 

10. (Once Amended) A method according to claim 1, wherein the U3 
region of the 5' LTR and/or the U3 region of the second internal 5' LTR comprises a 
heterologous promoter. 



provirus comprises two recombinase recognition sites and as a preliminary step, the 
recombinase is expressed in a host cell such that the nucleotide sequence present 
between the two sites is excised. 

12. (Once Amended) A method according to claim 1 , wherein the 
producer cell is a high titre producer cell, capable of producing at least 10 6 retrovirus 
particles per ml. 

13. (Once Amended) A method according to claim 1, wherein the 
provirus is a lentivirus. 

15. (Once Amended) A method according to claim 2, wherein the 
provirus further comprises a second NOI. 



8. 



(Once Amended) A method according to claim 1, wherein the 



11. 



(Once Amended) A method according to claim 1 , wherein the 
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1 6. (Once Amended) A producer cell obtainable by the method of claim 

1 . 

22. (Once Amended) A producer cell according to claim 18, wherein 
the third LTR is transcriptionally quiscent. 

24. (Once Amended) A producer cell according to claim 20, wherein 
the first NO! is a selectable marker. 

26. (Once Amended) A producer cell according to claim 25, wherein 
the second LTR comprises a deletion in the U3 sequences in the 3' LTR. 

27. (Once Amended) A producer cell according to claim 25, wherein 
the second NOI comprises a coding sequence operably linked to a promotor. 

30. (Once Amended) A method for producing a high titre regulatable 
retroviral vector, the method comprising: 

(i) providing a derived producer cell comprising integrated into its 
genome a first vector; 

(ii) introducing a second vector into the derived producer cell using 
a recombinase assisted method; 

wherein the derived producer cell comprises a retroviral vector comprising in the 5' to 3' 
direction a first LTR (5' LTR); a second NOI operably linked to a second LTR (regulatable 
3' LTR); and a third LTR (3' LTR); wherein the third LTR is positioned downstream of 
the second LTR in the derived producer cell. 

34. (Once Amended) A process for preparing a regulated retroviral 
vector, comprising performing the method according to claim 30 and preparing a 
quantity of the regulated retroviral vector. 
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38. (Once Amended) A regulated retroviral vector according to claim 
36, wherein the target site is a cell. 

40. (Once Amended) A regulated retroviral vector according to claim 
35, in combination with a pharmaceutically acceptable carrier. 

41 . (Once Amended) A medicament for diagnostic and/or therapeutic 
and/or medical applications, comprising a regulated retroviral vector according to claim 
35. 

43. (Once Amended) A derived stable producer cell capable of 
expressing regulated retroviral vectors according to claim 35. 

47. (Once Amended) A nucleic acid vector according to claim 45, 
further comprising a 5' LTR and/or a packaging signal. 

48. (Once Amended) A nucleic acid vector according to claim 45, 
wherein the LTR is a heterologous regulatable LTR. 

49. (Once Amended) A nucleic acid vector according to claim 45, 
wherein the LTR is transcriptionally quiscent. 



002.800339.1 
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REMARKS 

Applicants respectfully request that the foregoing amendments be made prior to 
examination of the present application. The amendments are made to correct multiple 
dependencies and do not change the scope of the invention. 



Date March 20, 2002 
FOLEY & LARDNER 
Customer Number: 22428 



22428 



PATENT TRADEMARK OFFICE 

Telephone: (202) 672-5427 
Facsimile: (202) 672-5399 



Respectfully submitted, 



By 




Bernhard D. Saxe 
Attorney for Applicants 
Registration No. 28,665 
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MARKED UP VERSION OF AMENDED CLAIMS 

3. (Once Amended) A method according to claim 1 x [or claim 2] 
wherein the construct further comprises the 5'LTR and/or the packaging signal. 

4. {Once Amended) A method according to [any one of claims] claim 
1 [to 3] x wherein the construct LTR is a heterologous regulatable LTR. 

6. (Once Amended) A method according to [any one of claims] claim 
1 X [to 3] wherein the construct LTR is inactive. 

7. (Once Amended) A method according to [any one of the preceding 
claims] claim 1 , wherein the provirus comprises an NOI encoding a selectable marker, 
which NOI is flanked by recombianse recognition sites. 

8. (Once Amended) A method according to [any one of the 
precedings claims] claim 1 , wherein the provirus comprises an internal 5' LTR upstream 
of the recombinase site or the 5' recombinase site where there is more than one site. 

10. (Once Amended) A method according to [any one of the preceding 
claims] claim 1 , wherein the U3 region of the 5' LTR and/or the U3 region of the second 
internal 5' LTR comprises a heterologous promoter. 

1 1 . (Once Amended) A method according to [any one of the preceding 
claims] claim 1 , wherein the provirus comprises two recombinase recognition sites and 
as a preliminary step, the recombinase is expressed in a host cell such that the 
nucleotide sequence present between the two sites is excised. 

12. (Once Amended) A method according to [any one of the preceding 
claims! claim 1 wherein the producer cell is a high titre producer cell, capable of 
producing at least 10 6 retrovirus particles per ml. 
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13. (Once Amended) A method according to [any one of the preceding 
claims] claim 1 , wherein the provirus is a lentivirus. 

15. (Once Amended) A method according to [any one of claims] claim 
2 [- 14], wherein the provirus further comprises a second NOI. 

16. (Once Amended) A producer cell obtainable by the method of [any 
one of claims] claim 1 [to 1 5]. 

22. (Once Amended) A producer cell according to [any one of claims] 
claim 18, [to 21] wherein the third LTR is transcriptionally quiscent. 

24. (Once Amended) A producer cell according to [any one of claims] 
claim 20, [to 23] wherein the first NOI is a selectable marker. 

26. (Once Amended) A producer cell according to claim 25 A wherein 
the second LTR comprises a deletion in the U3 sequences in the 3' LTR. 

27. (Once Amended) A producer cell according to claim 25 [or claim 
26] x wherein the second NOI comprises a coding sequence operably linked to a 
promotor. 

30. (Once Amended) A method for producing a high titre regulatable 
retroviral vector, the method comprising [the steps of]: 

(i) providing a derived producer cell comprising integrated into its 
genome a first vector; 

(ii) introducing a second vector into the derived producer cell using 
a recombinase assisted method; 

wherein the derived producer cell comprises a retroviral vector comprising in the 5' to 3' 
direction a first LTR (5' LTR); a second NOI operably linked to a second LTR (regulatable 
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3' LTR); and a third LTR (3' LTR); wherein the third LTR is positioned downstream of 
the second LTR in the derived producer cell. 

34. (Once Amended) A process for preparing a regulated retroviral 
vector, [as defined in claim 17] comprising performing the method according to [any 
one of claims] claim 30 [to 33] and preparing a quantity of the regulated retroviral 
vector. 

38. (Once Amended) A regulated retroviral vector according to claim 
36, [or claim 37] wherein the target site is a cell. 

40. (Once Amended) [Use of a] A regulated retroviral vector according 
to [any one of claims] claim 35, [to 38] in [the manufacture of a pharmaceutical 
composition to deliver an NOI to a target site] combination with a phar maceutical^ 
acceptable carrier . 

41 . (Once Amended) [Use of a regulated retroviral vector according to 
any one of claims 35 to 38 in the manufacture of a] A medicament for diagnostic 
and/or therapeutic and/or medical applications . comprisin g a regulated retroviral vector 
according to claim 35. 

43. (Once Amended) A derived stable producer cell capable of 
expressing regulated retroviral vectors according to claimts] 35 to [38]. 

47. (Once Amended) A nucleic acid vector according to claim 45, [or 
claim 46] further comprising a 5' LTR and/or a packaging signal. 

48. (Once Amended) A nucleic acid vector according to [any one of 
claims] claim 45, [to 47] wherein the LTR is a heterologous regulatable LTR. 

49. (Once Amended) A nucleic acid vector according to [any one of 
claims] claim 45, [to 47] wherein the LTR is transcriptionally quiscent. 
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PRODUCER CELL FOR THE PRODUCTION OF RETROVIRAL VECTORS 



FIELD OF THE INVENTION 

5 

The present invention relates to retroviral vectors, in particular to high titre regulatable 
retroviral vectors. 

BACKGROUND TO THE INVENTION 

10 

Retroviruses have been proposed as a delivery system (otherwise expressed as a delivery 
vehicle or delivery vector) for inter alia the transfer of a nucleotide sequence of interest 
(NOI), or a plurality of NOIs, to one or more sites of interest. The transfer can occur in 
vitro, ex vivo, in vivo, or combinations thereof. When used in this fashion, the retroviruses 
15 are typically called retroviral vectors or recombinant retroviral vectors. Retroviral vectors 
have been exploited to study various aspects of the retrovirus life cycle, including receptor 
usage, reverse transcription and RNA packaging (reviewed by Miller, 1992 Curr Top 
Microbiol Immunol 158:1-24). 

20 In a typical recombinant retroviral vector for use in gene therapy, at least part of one or 
more of the gag f pol and env protein coding regions may be removed from the virus. This 
makes the retroviral vector replication-defective. The removed portions may even be 
replaced by a NOI in order to generate a virus capable of integrating its genome into a 
host genome but wherein the modified viral genome is unable to propagate itself due to a 

25 lack of structural proteins. When integrated in the host genome, expression of the NOI 
occurs - resulting in, for example, a therapeutic effect. Thus, the transfer of a NOI into a 
site of interest is typically achieved by: integrating the NOI into the recombinant viral 
vector; packaging the modified viral vector into a virion coat; and allowing transduction of a 
site of interest - such as a targetted cell or a targetted cell population. 

30 

It is possible to propagate and isolate quantities of retroviral vectors (e.g. to prepare suitable 
titres of the retroviral vector) for subsequent transduction of, for example, a site of interest 
by using a combination of a packaging or helper cell line and a recombinant vector. 
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In some instances, propagation and isolation may entail isolation of the retroviral gag, pol 
and env genes and their separate introduction into a host cell to produce a "packaging cell 
line". The packaging cell line produces the proteins required for packaging retroviral 
RNA but it does not produce RNA-containing retroviral vectors. However, when a 
5 recombinant vector carrying a NOI and a psi region is introduced into the packaging cell 
line, the helper proteins can package the /wi-positive recombinant vector to produce the 
recombinant virus stock. This can be used to infect cells to introduce the NOI into the 
genome of the cells. The recombinant virus whose genome lacks all genes required to 
make viral proteins can infect only once and cannot propagate. Hence, the NOI is 
10 introduced into the host cell genome without the generation of potentially harmful 
retrovirus. A summary of the available packaging lines is presented in "Retroviruses" 
(1997 Cold Spring Harbour Laboratory Press Eds: JM Coffin, SM Hughes, HE Varmus 
pp 449). 

15 Retroviral packaging cell lines have been developed to produce retroviral vectors. These 
cell lines are designed to express three components, which may be located on three 
separate expression constructs. The gag/pol expression construct encodes structural and 
enzymatic components required in particle formation, maturation, reverse transcription 
and integration. The envelope (env) construct expresses a retroviral or non-retroviral 

20 envelope protein, which mediates viral entry into cells by binding to its cognate receptor. 
The third expression construct produces the retroviral RNA genome containing a psi 
region, which is packaged into mature, enveloped retroviral particles. 

It has been observed that different methods, such as electroporation, transfection and 
25 retroviral transduction, which have been used to introduce the retroviral expression 
construct for the RNA genome, termed "the genome", into packaging cells produce 
different results. These different results can include different end points or "yield" of 
retroviral producer lines resulting from the derived cell lines. Moreover, electroporation 
and transfection methods can be problematic in the sense that the titre levels are not always 
30 at a satisfactory level. 

By way of example, the transfection of a plasmid DNA construct into packaging cells 
from a MLV packaging cell line of human origin, called FLYA13, yielded low retroviral 
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vector titres even when different transfection reagents such as calcium phosphate 
precipitation and fugene transfection reagent were used. The average titres from selected 
stably transfected cell lines clones ranged from about 10 3 to about 10 4 per ml. In 
addition, clones generated by electroporation of constructs gave similar titres of from 
5 about 10 J to about 10 4 per ml with no clones identified producing at >10* per ml. 
However, when MLV vector particles are prepared in a transient transfection system with 
a different envelope pseudotype to the packaging cell, and used to transduce a retroviral 
packaging cell, stably transduced cell lines made by this transduction method produce 
retrovirus at 10 6 to 10 7 per ml. Therefore, these results suggest that retroviral transduction 
10 is a preferred method for genome introduction into packaging cell lines in order to 
generate high titre producer cell lines. However, when retroviral transduction is used to 
introduce a regulated/inactivated retroviral vector genome into packaging cell lines, the 
regulated retroviral vectors may not be produced in sufficient quantities from these cell 
lines. 

15 

By way of example, some retroviral vectors may comprise (i) internal expression 
constructs which are themselves regulated or (ii) regulated elements which are present in 
retroviral 3' LTR sequences, either by design or by their nature. Examples of these 
regulated vectors include but are not limited to hypoxic regulated vectors and self 

20 inactivating (SIN) vectors. If transduced producer cell lines are generated with these 
regulated vectors, the regulated or inactivated 3* U3 sequence of the LTR is copied to the 
5' LTR by the process of retroviral reverse transcription and integration. Therefore, in the 
producer cell line, the 5' U3 promoter sequence directing expression of retroviral RNA 
genomes is identical to the regulated or inactivated 3' U3 promoter. This will result in 

25 very low levels of retroviral genome production and consequently low titres of functional 
retrovirus vectors being produced. 

One example of such a regulated retroviral system includes MLV and lentivirus vector 
constructs where the 3' retroviral U3 enhancer element is replaced with a hypoxia 
30 responsive element (HRE) or other physiologically regulated, tumour specific or tissue- 
specific promoters. When these vectors are used to make a transduced producer cell line, 
the 3* U3 sequence containing the HRE element is copied to the 5' LTR position and 
retroviral genomes will only be produced under hypoxic conditions or chemical mimics 
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of hypoxia, such as heavy metal ions and desferoxamine. Such a requirement for 
"induction for retroviral production" is not preferable as the different hypoxia induction 
protocols negatively affect retroviral producer cell viability. 

By way of further example, lentivector U3 enhancers are dependent on the transactivator 
TAT for transcriptional activation. Therefore, a lentivector producer cell line generated 
by transduction requires the presence of TAT for high level expression of the lentivector 
genome construct. The expression of TAT is not preferable in such a packaging cell line 
and therefore, in the absence of TAT, only very low litres will be produced from 
transduced producer cells generated in this way. 



Another example of a regulated retroviral systems includes MLV or lentivirus self- 
inactivating (SIN) vectors. These vectors contain deletions of the elements in their 3' U3 
sequences responsible for transcriptional activity. Therefore, on transduction of target 
15 cells, the transcriptionally inactive 3' U3 sequence is copied to the 5' LTR position. In 
standard configurations, an internal expression cassette directs therapeutic or marker gene 
expression. However, if SIN vectors are used to make a transduced retroviral producer 
line, there will be no transcriptional elements present to direct high levels of retroviral 
RNA genome expression. 



20 



Although it is possible to carry out retroviral transduction with much lower-titre vector 
stocks, for practical reasons, high-titre retrovirus is desirable, especially when a large 
number of cells must be infected. In addition, high titres are a requirement for 
transduction of a large percentage of certain cell types. For example, the frequency of 

25 human hematopoietic progenitor cell infection is strongly dependent on vector titre, and 
useful frequencies of infection occur only with very high-titre stocks (Hock and Miller 
1986 Nature 320: 275-277; Hogge and Humphries 1987 Blood 69: 611-617). In these 
cases, it is not sufficient simply to expose the cells to a larger volume of virus to 
compensate for a low virus titre. On the contrary, in some cases, the concentration of 

30 infectious vector virions may be critical to promote efficient transduction. 
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SUMMARY OF THE INVENTION 

We have now shown that it is possible to obtain transduced producer cells capable of 
producing high titre regulated retroviral vectors by replacing at least the 3'LTR of the 

5 integrated provirus using a recombinase based system. Thus whereas with the prior art, 
the U3 region of the 3'LTR is the same as that of the U3 region of the 5' LTR (and vice 
versa for the U5 region) in the provirus due to the way in which the viral vector 
integrates, the introduction of, for example, a replacement 3'LTR results in a provirus that 
has a 5' LTR and a 3'LTR that differ. The packaged viral vectors produced by 

10 transcription of the proviral genome within the producer cells may then ultimately be 
used to transduce target cells where the regulatable sequences present in the 3'LTR of the 
provirus in the producer cells are then present in the 5 'LTR of the provirus in the target 
cells and consequently regulate transcription from the provirus as required. 

15 This allows the introduction of a 3'LTR, for example a regulatable 3'LTR, into the 
provirus that was not desirable in the original viral vector used to transduce the producer 
cells since the consequential appearance of the regulatable 3'LTR U3 sequences in the 
5 'LTR in the provirus may lead to a reduced viral titre. 

20 Consequently, the present invention allows transduced producer cells to be constructed 
that are capable of producing high titre regulated retroviral vectors by virtue of 
comprising a 5 'LTR that directs high level expression of the viral genome in the producer 
cell and a different 3'LTR which as a result of the process of integration into a target cell 
will then result in a provirus in the target cell genome that exhibits regulatable expression. 

25 

In particular, the present invention allows the modification of a provirus integrated into 
the genome of the producer cells that have been selected for their high titre virus 
production such that the resulting packaged viral particles produced from the provirus 
may be used to transduce target cells resulting in a provirus integrated into the genome of 
30 the target cells that has a different, and preferably regulatable 5 'LTR to that of the 
producer cell provirus. 
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The present invention is not limited to replacement of the 3 'LTR of the provirus in the 
high titre producer cells, but may also include replacement of the 5'LTR and other viral 
sequences and/or the introduction of NOIs by the use of suitable constructs, as shown in 
the Figures. 

5 

Accordingly, the present invention provides a method of modifying a producer cell which 
producer cell comprises integrated into its genome a provirus which provirus comprises 
one or more recombinase recognition sequences within or upstream of its 3' LTR, the 
method comprising: introducing into the cell a construct comprising a 5' recombinase 
10 recognition sequence, an LTR and a 3' recombinase recognition sequence in that order, in 
the presence of a recombinase which is capable of acting on the recombinase recognition 
site(s) such that the nucleotide sequence between the 5' and 3' recombinase recognition 
sequences in the construct is introduced into the provirus. 

1 5 Preferably the LTR is a heterologous regulatable LTR. 

The present invention further provides a nucleic acid vector comprising a 5' recombinase 
recognition sequence, a regulatable LTR and a 3' recombinase recognition sequence in 
that order. 

20 

In any of the above aspects and embodiments of the invention, preferably the construct, 
nucleic acid molecule and/or nucleic acid vector further comprises at least one NOI 
between the 5' recombinase recognition sequence and the regulatable LTR. 

25 Preferably the construct, nucleic acid molecule and/or nucleic acid vector further 
comprises a 5'LTR and/or a packaging signal 

In one embodiment of the invention, the LTR is inactive/transcriptionally quiescent. 

30 The construct, nucleic acid molecule and/or nucleic acid vector of the invention may be 
used in a recombinase assisted method to introduce a regulated LTR into a proviral 
genome integrated into a producer cell genome. 
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The present invention also provides a producer cell obtainable by the method of the 
invention, preferably a high titre producer cells. Also provided is an infectious retroviral 
particle obtained by the above method. 

5 The present invention further provides' a high titre producer cell comprising integrated 
into its genome a provirus, which provirus comprises a recombinase recognition site, a 5' 
LTR and a 3'LTR which 3'LTR differs from the 5'LTR. Such a producer cell will 
typically have been produced by the method of the invention. 

10 Preferably the 5* LTR and the 3'LTR referred to for the purposes of comparison are both 
"active". The term "active" within the present context means transcriptionally active, that 
is to say, the 5 'LTR comprises a promoter that directs transcription of the viral genome 
and the 3'LTR comprises a transcriptional stop sequence to terminate transcription. This 
distinction is relevant since if a provirus produced by the method of the invention 

15 comprises more than one 5' LTR or 3'LTR, at least one but not all must be active to 
allow viral production. Further, if the provirus comprises more than one 3'LTR then it is 
generally the upstream one that will be active since transcription will tend not to read 
through to the downstream 3' LTR. 

20 In addition, where the method of the invention results in an insertion of a 3'LTR upstream 
of the original 3'LTR, the comparison should be performed between the additional 3'LTR 
and the original 5'LTR and not the two original LTRs. Thus it is permitted to have a 
5 'LTR and 3'LTR within the same provirus that are the same provided that there is also a 
5'LTR and 3'LTR that differ. 

25 

In another aspect, the present invention provides a derived producer cell comprising 
integrated into its genome a retroviral vector comprising in the 5' to 3' direction a first 5' 
LTR; a second NOI operably linked to a second regulatable 3' LTR; and a third 3'LTR; 
wherein the third 3'LTR is positioned downstream of the second regulatable 3'LTR in the 
30 producer cell. 
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Preferably the first 5' LTR comprising 5'R and 5' U5 sequences is derivable from a first 
vector; the second NOI operably linked to a second regulatable 3' LTR is derivable from 
a second vector; and the third 3 'LTR is derivable from the first vector. 

5 In a preferred embodiment, the first vector further comprises an internal LTR located 
upstream of the first NOI and downstream of the packaging signal wherein the Internal 
LTR comprises a heterologous U3 sequence linked to heterologous R and U5 sequences. 

Preferably the heterologous R and U5 sequences are Ientiviral derivable R and U5 
10 sequences, such as EIAV R and U5 sequences. 

In a further preferred embodiment, the third 3 'LTR is transcriptionally active but 
expression is directed away from the second regulatable 3 'LTR. 

15 In another embodiment, the second vector comprises a second NOI operably linked to a 
second regulatable 3' LTR comprising at least one recombinase recognition sequence. 
Preferably the second regulatable 3' LTR comprises a deletion in the U3 sequences in the 
3' LTR. 

20 Preferably, the second NOI comprises a discistronic construct, more preferably a 
discistronic construct comprising a therapeutic gene, an internal ribosomal entry site 
(IRES) and a reporter gene. 

The present invention further provides in another embodiment, a method for producing a 
25 high titre regulatable retroviral vector, the method comprising the steps of: 

(i) providing a derived producer cell comprising integrated into its genome a first vector; 

(ii) introducing a second vector into the derived producer cell using a recombinase 
assisted method; wherein the derived producer cell comprises a retroviral vector 
comprising in the 5' to 3' direction a first 5' LTR; a second NOI operably linked to a 

30 second regulatable 3' LTR; and a third 3 'LTR; wherein the third 3 'LTR is positioned 
downstream of the second regulatable 3' LTR in the derived producer cell. 
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The present invention also provides the use of a recombinase assisted mechanism to 
introduce a regulated 3'LTR into a derived producer cell line to produce a high titre 
regulated retroviral vector. 

5 Aspects of the present invention are also presented in the accompanying claims and in the 
following description and discussion. 

These aspects are presented under separate section headings. However, it is to be 
understood that the teachings under each section heading are not necessarily limited to 
10 that particular section heading. 

DETAILED DESCRIPTION OF THE INVENTION 

The present invention is advantageous because: 

15 

(i) it enables regulated retroviral vectors to be produced at high titres from transduced 
producer cell lines. 

(ii) it removes the uncertainty associated with the process of producer cell line derivation 
20 and the necessity to screen large numbers of producer cell lines each time a new retroviral 

expression construct is introduced into a producer cell line. 

(iii) it greatly facilitates the generation of high titre retroviral stocks without the use of 
marker genes (such as but not limited to p-galactosidase, green fluorescent protein) and 

25 antibiotic resistance genes. 

(iv) it avoids the derivation of low titre transfected producer cell lines or the use of 
hypoxic conditions or chemical mimics for production from traditionally derived 
transduced producer lines. 

30 

(v) it enables the production of SIN vectors by stable cell line producer technology. 
Previously, SIN vectors have not been amenable to production by stable cell line producer 
technology because the deletion of the 3'U3 sequence resulted in at least a tenfold lower 
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titre of self-inactivating (SIN) vectors in comparison with vectors having intact LTRs. 
Consequently, SIN vectors have had to be prepared using transfection-based transient 
expression systems. 

5 PRODUCER CELL 

The high titre regulated retroviral vector particles of the present invention are typically 
generated in a suitable producer cell. Producer cells are generally mammalian cells but 
can be, for example, insect cells. A producer cell may be a packaging cell containing the 

10 virus structural genes, normally integrated into its genome into which the regulated 
retroviral vectors of the present invention are introduced. Alternatively the producer cell 
may be transfected with nucleic acid sequences encoding structural components, such as 
gag/pol/env on one or more vectors such as plasmids, adenovirus vectors, herpes viral 
vectors or any method known to deliver functional DNA into target cells. The vectors 

15 according to the present invention are then introduced into the packaging cell by the 
methods of the present invention. 

As used herein, the term "producer cell" or "vector producing cell" refers to a cell which 
contains all the elements necessary for production of regulated retroviral vector particles 
20 and regulated retroviral delivery systems. 

Preferably, the producer cell is obtainable from a stable producer cell line. 

Preferably, the producer cell is obtainable from a derived stable producer cell line. 

25 

Preferably, the producer cell is obtainable from a derived producer cell line 

As used herein, the term "derived producer cell line" is a transduced producer cell line 
which has been screened and selected for high expression of a marker gene. Such cell 
30 lines contain retroviral insertions in integration sites that support high level expression 
from the retroviral genome. The term "derived producer cell line" is used 
interchangeably with the term "derived stable producer cell line" and the term "stable 
producer cell line 
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Preferably the derived producer cell line includes but is not limited to a retroviral and/or a 
lentiviral producer cell. 

Preferably the derived producer cell line is an HIV or EIAV producer cell line, more 
5 preferably an EIAV producer cell line. 

Preferably the envelope protein sequences, and nucleocapsid sequences are all stably 
integrated in the producer and/or packaging cell. However, one or more of these 
sequences could also exist in episomal form and gene expression could occur from the 
10 episome. 

PACKAGING CELL 

As used herein, the term "packaging cell" refers to a cell which contains those elements 
15 necessary for production of infectious recombinant virus which are lacking in a 
recombinant viral vector. Typically, such packaging cells contain one or more expression 
cassettes which are capable of expressing viral structural proteins (such as gag, pol and 
env) but they do not contain a packaging signal. 

20 The term "packaging signal" which is refered to interchangeably as "packaging sequence" 
or "psf* is used in reference to the non-coding sequence required for encapsidation of 
retroviral RNA strands during viral particle formation. 

Packaging cell lines suitable for use with the above-described vector constructs may be 
25 readily prepared (see also WO 92/05266), and utilised to create producer cell lines for the 
production of retroviral vector particles. As already mentioned, a summary of the 
available packaging lines is presented in "Retroviruses" (1997 Cold Spring Harbour 
Laboratory Press Eds: JM Coffin, SM Hughes, HE Varmus pp 449). 

30 The packaging cell lines are useful for providing the gene products necessary to 
encapsidate and provide a membrane protein for a high titre regulated retrovirus vector 
and regulated nucleic gene delivery vehicle production. When regulated retrovirus 
sequences are introduced into the packaging cell lines, such sequences are encapsidated 
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with the nucleocapsid (gag/poI) proteins and these units then bud through the cell 
membrane to become surrounded in cell membrane and to contain the envelope protein 
produced in the packaging cell line. These infectious regulated retroviruses are useful as 
infectious units per se or as gene delivery vectors. 

5 

The packaging cell may be a cell cultured in vitro such as a tissue culture cell line. 
Suitable cell lines include but are not limited to mammalian cells such as murine 
fibroblast derived cell lines or human cell lines. Preferably the packaging cell line is a 
human cell line, such as for example: HEK293, 293-T, TE671, HT1080. 

10 

Alternatively, the packaging cell may be a cell derived from the individual to be treated 
such as a monocyte, macrophage, blood cell or fibroblast. The cell may be isolated from 
an individual and the packaging and vector components administered ex vivo followed by 
re-administration of the autologous packaging cells. 

15 

Methods for introducing retroviral packaging and vector components into 
packaging/producer cells are described in the present invention. 

Preferably the method of the present invention utilises a recombinase assisted mechanism. 

20 

Preferably the method of the present invention utilises a recombinase assisted mechanism 
which facilitates the production of high titre regulated retroviral vectors from the 
producer cells of the present invention. 

25 RECOMBINASE ASSISTED MECHANISM 

As used herein, the term "recombinase assisted system" includes but is not limited to a 
system using the Cre recombinase / loxP recognition sites of bacteriophage PI or the site- 
specific FLP recombinase of S. cerevisiae which catalyses recombination events between 
30 34 bp FLP recognition targets (FRTs). 

The site-specific FLP recombinase of 5. cerevisiae which catalyses recombination events 
between 34 bp FLP recognition targets (FRTs) has been configured into DNA constructs 
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in order to generate high level producer cell lines using recombinase-assisted 
recombination events (Karreman et al. (1996) NAR 24, 1616-1624). A similar system 
has been developed using the Cre recombinase / loxP recognition sites of bacteriophage 
PI. This was configured into a retroviral genome such that high titre retroviral producer 

5 cell lines were generated (Vanin et al. (1997) J Virol 71, 7820-7826). However, the use 
of the second method (Vanin et al ibid) has centered around the exchange of the central 
portions of a retroviral cassette using a recombinase-assisted system. Moreover, these 
methods have used genes encoding selectable markers such as neo R and puro R (Vanin et 
al ibid) and luciferase and puro R linked by an IRES sequence (Karreman et al ibid). 

10 Karreman and Vanin do not demonstrate or suggest that: (i) a regulated or inactive 3'U3 
sequence of the 3'LTR can be introduced into a producer cell via a recombinase-assisted 
mechanism or (ii) that therapeutic genes under the control of a regulated LTR may be 
introduced into a producer cell line via a recombinase assisted step. Vanin et al ibid 
suggests that his Cre-mediated recombination approach to retroviral producer cell line 

15 production may be used in combination with other modifications which should result in 
improved vector performance. Vanin et al ibid also suggests that his approach provides a 
means to generate high titre SIN vectors. However, there is no worked example and in 
fact no enabling disclosure because the skilled person would not have been aware, on the 
basis of the Vanin et al paper, of the necessary modifications to make the suggested 

20 approach work. Vanin et al makes no reference to hypoxic regulated vectors and/or 
regulated/inactivated lentiviral vectors. 



LTRs 

25 As already indicated, each retroviral genome comprises genes called gag, pol and env 
which code for virion proteins and enzymes. In the provirus, these genes are flanked at 
both ends by regions called long terminal repeats (LTRs). The LTRs are responsible for 
proviral integration, and transcription. They also serve as enhancer-promoter sequences. 
In other words, the LTRs can control the expression of the viral gene. Encapsidation of 

30 the retroviral RNAs occurs by virtue of a psi sequence located at the 5' end of the viral 
genome. 
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As used herein, the term "long terminal repeat (LTR) is used in reference to domains of 
base pairs located at the end of retroviral DNAs. 

The LTRs themselves are identical sequences that can be divided into three elements, 
5 which are called U3, R and U5. U3 is derived from the sequence unique to the 3' end of 
the RNA. R is derived from a sequence repeated at both ends of the RNA and U5 is 
derived from the sequence unique to the 5' end of the RNA. The sizes of the three 
elements can vary considerably among different retroviruses. 



10 For ease of understanding, a simple, generic structures (not to scale) of the RNA and the 
DNA forms of the MLV retroviral genome is presented in Figure 7 in which the 
elementary features of the LTRs and the relative positioning of gaglpol and env are 
indicated. Please note that (i) gag/pol and env are normally not spaced apart; and (ii) the 
overlap normally present between the pol and env genes and the poly A tail normally 

15 present at the 3' end of the RNA transcript are not illustrated in Figure 7. 

As shown in Figure 7, the basic molecular organisation of an infectious retroviral RNA 
genome is (5 s ) R - U5 - gag/pol, env - U3-R (3'). In a defective retroviral vector genome 
gag, pol and env may be absent or not functional. The R regions at both ends of the 
20 RNA are repeated sequences. U5 and U3 represent unique sequences at the 5' and 3' 
ends of the RNA genome respectively. 



Upon cellular transduction, reverse transcription of the virion RNA into double stranded 
DNA takes place in the cytoplasm and involves two jumps of the reverse transcriptase 

25 from the 5' terminus to the 3' terminus of the template molecule. The result of these 
jumps is a duplication of sequences located at the 5' and 3' ends of the virion RNA. 
These sequences then occur fused in tandem on both ends of the viral DNA, forming the 
long terminal repeats (LTRs) which comprise R U5 and U3 regions. On completion of 
the reverse transcription, the viral DNA is translocated into the nucleus where the linear 

30 copy of the retroviral genome, called a preintegration complex (PIC), is randomly 
inserted into chromosomal DNA with the aid of the virion integrase to form a stable 
provirus. The number of possible sites of integration into the host cellular genome is very 
large and very widely distributed. 
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Preferably the retroviral genome is introduced into packaging ceil lines using retroviral 
transduction. 

Preferably retroviral vector particles (such as MLV vector particles) are prepared in a 
transient expression system with a different envelope pseudotype to the packaging cell, 
and used to transduce a retroviral packaging cell. 

Preferably the retroviral transduction step identifies retroviral insertions in integration 
sites that support high level expression of the resulting regulated retroviral genome. 

Preferably stable transduced producer cell lines made by this initial retroviral transduction 
step produce retrovirus at titres of at least 10 6 per ml, such as from about 10 6 to about 10 7 
per ml, more preferably at least about 10 7 per ml. 

HIGH TITRE 

As used herein, the term "high titre" means an effective amount of a retroviral vector or 
particle which is capable of transducing a target site such as a cell. 

As used herein, the term "effective amount" means an amount of a regulated retroviral or 
lentiviral vector or vector particle which is sufficient to induce expression of an NO I at a 
target site. 

Preferably the titre is from at least 10 6 retrovirus particles per ml, such as from about 10 6 
to about 10 7 per ml, more preferably at least about 10 7 retrovirus particles per ml. 

TRANSCRIPTIONAL CONTROL 

The control of pro viral transcription remains largely with the noncoding sequences of the 
viral LTR. The site of transcription initiation is at the boundary between U3 and R in the 
left hand side LTR (as shown in Figure 7) and the site of poly (A) addition (termination) 
is at the boundary between R and U5 in the right hand side LTR (as shown in Figure 7). 
The 3'U3 sequence contains most of the transcriptional control elements of the provirus, 



WO 01/25466 PCT/GB00/03837 

-16- 

which include the promoter and multiple enhancer sequences responsive to cellular and in 
some cases, viral transcriptional activator proteins. 



REGULTABLE LTRs 

5 

AN LTR present, for example, in the construct of the invention and as a 3 'LTR in the 
provirus of the producer cell of the invention may be a native LTR or a heterologous 
regulatable LTR. It may also be a transcriptionally quiescent LTR for use in SIN vector 
technology. 

10 

As used herein, the terms "regulatable LTR" and "regulatable 3 'LTR" include vectors 
which contain responsive elements which are present in retroviral 3' LTR sequences, 
either by design or by their nature. As used herein, vectors comprising a "regulatable 
3' LTR" are referred to as "regulated retroviral vectors". Within the regulatable 3' LTR 
15 region, the 3'U3 sequence contains most of the transcriptional control elements of the 
provirus, which include the promoter and multiple enhancer sequences responsive to 
cellular and in some cases, viral transcriptional activator proteins. 

Responsive elements include but are not limited to elements which comprise, for 
20 example, promoter and multiple enhancer sequences responsive to cellular and in some 
cases, viral transcriptional activator proteins and/or elements which have been modified 
to render them inactive. As used herein, the term "modified" includes but is not limited 
to silencing, disabling, mutating, deleting or removing all of the U3 sequence or a part 
thereof. 

25 

The term "regulated LTR" also includes an inactive LTR such that the resulting provirus 
in the target cell can not produce a packagable viral genome (self-inactivating (SIN) 
vector technology) - see the Examples and Figure 6 for a particular embodiment. 



30 
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ENHANCER 

As used herein, the term "enhancer" includes a DNA sequence which binds other protein 
components of the transcription initiation complex and thus facilitates the initiation of 
5 transcription directed by its associated promoter. 

In one preferred embodiment of the present invention, the enhancer is an ischaemic like 
response element (ILRE), 

10 ILRE 

The term "ischaemia like response element" - otherwise written as ILRE - includes an 
element that is responsive to or is active under conditions of ischaemia or conditions that 
are like ischaemia or are caused by ischaemia. By way of example, conditions that are 
15 like ischaemia or are caused by ischaemia include hypoxia and/or low glucose 
concentration(s). 

The term "hypoxia" means a condition under which a particular organ or tissue receives 
an inadequate supply of oxygen. 

20 

Ischaemia can be an insufficient supply of blood to a specific organ or tissue. A 
consequence of decreased blood supply is an inadequate supply of oxygen to the organ or 
tissue (hypoxia). Prolonged hypoxia may result in injury to the affected organ or tissue. 

25 A preferred ILRE is an hypoxia response element (HRE). 

HRE 

In one preferred aspect of the present invention, there is hypoxia or ischaemia regulatable 
30 expression of the retroviral vector components. In this regard, hypoxia is a powerful 
regulator of gene expression in a wide range of different cell types and acts by the 
induction of the activity of hypoxia-inducible transcription factors such as hypoxia 
inducible factor-1 (HIF-1; Wang & Semenza 1993 Proc Natl Acad Sci 90:430), which 
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bind to cognate DNA recognition sites, the hypoxia-responsive elements (HREs) on 
various gene promoters. Dachs et al (1997 Nature Med 5: 515) have used a multimeric 
form of the HRE from the mouse phosphoglycerate kinase- 1 (PGK-1) gene (Firth et al 
1994 Proc Natl Acad Sci 91:6496-6500) to control expression of both marker and 
5 therapeutic genes by human fibrosarcoma cells in response to hypoxia in vitro and within 
solid tumours in vivo (Dachs et al ibid). 

Hypoxia response enhancer elements (HRJEEs) have also been found in association with a 
number of genes including the erythropoietin (EPO) gene (Madan et al 1993 Proc Natl 

10 Acad Sci 90: 3928; Semenza and Wang 1992 Mol Cell Biol 1992 12: 5447-5454). Other 
HREEs have been isolated from regulatory regions of both the muscle glycolytic enzyme 
pyrivate kinase (PKM) gene (Takenaka et al 1989 J Biol Chem 264: 2363-2367), the 
human muscle-specific P-enolase gene (EN03; Peshavaria and Day 1991 Biochem J 275: 
427-433 ) and the endothelial (ET-1) gene (Inoue et al 1989 J Biol Chem 264: 14954- 

15 14959). 

Preferably the HRE of the present invention is selected from, for example, the 
erythropoietin HRE element (HREE1), muscle pyruvate kinase (PKM), HRE element, 
phosphoglycerate kinase (PGK) HRE, B-enolase (enolase 3; EN03) HRE element, 
20 endothelin-1 (ET-l)HRE element and metallothionein II (MTII) HRE element. 

RESPONSIVE ELEMENT 

Preferably the ILRE is used in combination with a transcriptional regulatory element , 
25 such as a promoter, which transcriptional regulatory element is preferably active in one or 
more selected cell type(s), preferably being only active in one cell type. 

As outlined above, this combination aspect of the present invention is called a responsive 
element. 

30 

Preferably the responsive element comprises at least the ILRE as herein defined. 
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Non-limiting examples of such a responsive element are presented as OBHRE1 and 
XiaMac. Another non-limiting example includes the ILRE in use in conjunction with an 
MLV promoter and/or a tissue restricted ischaemic responsive promoter. These 
responsive elements are disclosed in W099/ 15684. 

Other examples of suitable tissue restricted promoters/enhancers are those which are 
highly active in tumour cells such as a promoter/enhancer from a MUCX gene, a CEA 
gene or a 5T4 antigen gene. The alpha fetoprotein (AFP) promoter is also a tumour- 
specific promoter. One preferred promoter-enhancer combination is a human 
cytomegalovirus (hCMV) major immediate early (MIE) promoter/enhancer combination. 

PROMOTER 

The term "promoter" is used in the normal sense of the art, e.g. an RNA polymerase 
binding site. 

The promoter may be located in the retroviral 5' LTR to control the expresssion of a 
cDNA encoding an NOT 

Preferably the NOI is capable of being expressed from the retrovirus genome such as 
from endogenous retroviral promoters in the long terminal repeat (LTR) 

Preferably the NOI is expressed from a heterologous promoter to which the heterologous 
gene or sequence is operably linked. 

Alternatively, the promoter may be an internal promoter. 

Preferably the NOI is expressed from an internal promoter. 

Vectors containing internal promoters have also been widely used to express multiple 
genes. An internal promoter makes it possible to exploit promoter/enhancer 
combinations other than those found in the viral LTR for driving gene expression. 
Multiple internal promoters can be included in a retroviral vector and it has proved 
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possible to express at least three different cDNAs each from its own promoter (Overell et 
al 1988 Mol Cell Biol 8: 1803-1808). Internal ribosomal entry site (IRES) elements have 
also been used to allow translation of multiple coding regions from either a single mRNA 
or from fusion proteins that can then be expressed from an open reading frame, 

5 

TISSUE SPECIFIC PROMOTERS 

The promoter of the present invention may be constitutively efficient, or may be tissue or 
temporally restricted in their activity. 

10 

Preferably the promoter is a constitutive promoter such as CMV. 

Preferably the promoters of the present invention are tissue specific. 

15 That is, they are capable of driving transcription of a NOI or NOI(s) in one tissue while 
remaining largely "silent" in other tissue types. 

The term "tissue specific" means a promoter which is not restricted in activity to a single 
tissue type but which nevertheless shows selectivity in that they may be active in one 
20 group of tissues and less active or silent in another group. 

The level of expression of an NOI or NOIs under the control of a particular promoter may 
be modulated by manipulating the promoter region. For example, different domains 
within a promoter region may possess different gene regulatory activities. The roles of 
25 these different regions are typically assessed using vector constructs having different 
variants of the promoter with specific regions deleted (that is, deletion analysis). This 
approach may be used to identify, for example, the smallest region capable of conferring 
tissue specificity or the smallest region conferring hypoxia sensitivity. 

50 A number of tissue specific promoters, described above, may be particularly 
advantageous in practising the present invention. In most instances, these promoters may 
be isolated as convenient restriction digestion fragments suitable for cloning in a selected 
vector. Alternatively, promoter fragments may be isolated using the polymerase chain 



WO 01/25466 



PCT/GB00/03837 



-21- 

reaction. Cloning of the amplified fragments may be facilitated by incorporating 
restriction sites at the 5' end of the primers. 

The NOI or NOIs may be under the expression control of an expression regulatory 
element, such as a promoter and enhancer. 

Preferably the ischaemic responsive promoter is a tissue restricted ischaemic responsive 
promoter. 

Preferably the tissue restricted ischaemic responsive promoter is a macrophage specitic 
promoter restricted by repression. 

Preferably the tissue restricted ischaemic responsive promoter is an endothelium specific 
promoter. 

Preferably the regulated retroviral vector of the present invention is an ILRE regulated 
retroviral vector. 

Preferably the regulated retroviral vector of the present invention is an ILRE regulated 
lentiviral vector. 

Preferably the regulated retroviral vector of the present invention is an autoregulated 
hypoxia responsive lentiviral vector. 

Preferably the regulated retroviral vector of the present invention is regulated by glucose 
concentration. 

For example, the glucose-regulated proteins (grp's) such as grp78 and grp94 are highly 
conserved proteins known to be induced by glucose deprivation (Attenello and Lee 1984 
Science 226 187-190). The grp 78 gene is expressed at low levels in most normal healthy 
tissues under the influence of basal level promoter elements but has at least two critical 
"stress inducible regulatory elements" upstream of the TATA element (Attenello 1984 
ibid; Gazit et al 1995 Cancer Res 55: 1660-1663). Attachment to a truncated 632 base 
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pair sequence of the 5 "end of the grp78 promoter confers high inducibility to glucose 
deprivation on reporter genes in vitro (Gazit et al 1995 ibid). Furthermore, this promoter 
sequence in retroviral vectors was capable of driving a high level expression of a reporter 
gene in tumour cells in murine fibrosarcomas, particularly in central relatively 
5 ischaemic/fibrotic sites (Gazit et al 1 995 ibid). 

Preferably the regulated retroviral vector of the present invention is a self-inactivating 
(SIN) vector. 

10 By way of example, self-inactivating retroviral vectors have been constructed by deleting 
the transcriptional enhancers or the enhancers and promoter in the U3 region of the 3' 
LTR. After a round of vector reverse transcription and integration, these changes are 
copied into both the 5' and the 3' LTRs producing a transcriptionally inactive provirus 
(Yu et al 1986 Proc Natl Acad Sci 83: 3194-3198; Dougherty and Temin 1987 Proc Natl 

15 Acad Sci 84: 1 197-1201; Hawley et al 1987 Proc Natl Acad Sci 84: 2406-2410; Yee et al 
1987 Proc Natl Acad Sci 91: 9564-9568). However, any promoter(s) internal to the LTRs 
in such vectors will still be transcriptionally active. This strategy has been employed to 
eliminate effects of the enhancers and promoters in the viral LTRs on transcription from 
internally placed genes. Such effects include increased transcription (Jolly et al 1983 

20 Nucleic Acids Res 1 1: 1855-1872) or suppression of transcription (Emerman and Temin 
1984 Cell 39: 449-467). This strategy can also be used to eliminate downstream 
transcription from the 3' LTR into genomic DNA (Herman and Coffin 1987 Science 236: 
845-848). This is of particular concern in human gene therapy where it is of critical 
importance to prevent the adventitious activation of an endogenous oncogene. 

25 

RETROVIRAL VECTORS 

The regulated retroviral vector of the present invention includes but is not limited to: 
murine leukemia virus (MLV), human immunodeficiency virus (HIV), equine infectious 
30 anaemia virus (EIAV), feline immunodeficiency virus (FIV), caprine encephalitis-arthritis 
virus (CAEV), mouse mammary tumour virus (MMTV), Rous sarcoma virus (RSV), 
Fujinami sarcoma virus (FuSV), Moloney murine leukemia virus (Mo-MLV), FBR 
murine osteosarcoma virus (FBR MSV), Moloney murine sarcoma virus (Mo-MSV), 
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Abelson murine leukemia virus (A-MLV). Avian myelocytomatosis virus-29 (MC29). 
and Avian erythroblastosis virus (AEV). 

A detailed list of retroviruses may be found in Coffin et al ("Retroviruses" 1997 Cold 
Spring Harbour Laboratory Press Eds: JM Coffin, SM Hughes, HE Varmus pp 758-763). 

Preferred vectors for use in accordance with the present invention are retroviral vectors, 
such as MLV vectors. 

Preferably the recombinant retroviral vectors of the present invention are lentivirai 
vectors, more preferably HIV or EIAV vectors. 

LENTI VIRAL VECTORS 

The Antiviruses can be divided into primate and non-primate groups. Examples of 
primate lenti viruses include but are not limited to: the human immunodeficiency virus 
(HIV), the causative agent of human auto-immunodeficiency syndrome (AIDS), and the 
simian immunodeficiency virus (SIV). The non-primate lentivirai group includes the 
prototype "slow virus" visna/maedi virus (VMV), as well as the related caprine arthritis- 
encephalitis virus (CAEV), equine infectious anaemia virus (EIAV) and the more recently 
described feline immunodeficiency virus (FIV) and bovine immunodeficiency virus 
(BIV). 

A distinction between the lentivirus family and other types of retroviruses is that 
Antiviruses have the capability to infect both dividing and non-dividing cells (Lewis et al 
1992 EMBO. J 11: 3053-3058; Lewis and Emerman 1994 J. Virol. 68: 510-516). In 
contrast, other retroviruses - such as MLV - are unable to infect non-dividing cells such as 
those that make up, for example, muscle, brain, lung and liver tissue. 

Preferred vectors for use in accordance with the present invention are recombinant 
retroviral vectors, in particular recombinant lentivirai vectors, in particular minimal 
lentivirai vectors which are disclosed in WO 99/32646 and in W098/17815. 
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VECTOR 

As used herein, a ''vector' 1 denotes a tool that allows or faciliates the transfer of an entity 
from one environment to another. In accordance with the present invention, and by way 
of example, some vectors used in recombinant DNA techniques allow entities, such as a 
segment of DNA (such as a heterologous DNA segment, such as a heterologous cDNA 
segment), to be transferred into a target cell. Optionally, once within the target cell, the 
vector may then serve to maintain the heterologous DNA within the cell or may act as a 
unit of DNA replication. Examples of vectors used in recombinant DNA techniques 
include plasmids. chromosomes, artificial chromosomes or viruses. 

OPERABLY LINKED 

The term "operably linked' 5 denotes a relationship between a regulatory region (typically 
a promoter element, but may include an enhancer element) and the coding region of a 
gene, whereby the transcription of the coding region is under the control of the regulatory 
region. 

DERIVABLE 

The term "derivable" is used in its normal sense as meaning a nucleotide sequence such as 
an LTR or a part thereof which need not necessarily be obtained from an vector such as a 
retroviral vector but instead could be derived therefrom. By way of example, the sequence 
may be prepared synthetically or by use of recombinant DNA techniques. 

VECTOR PARTICLES 

In the present invention, several terms are used interchangeably. Thus, "virion", "virus", 
"viral particle", "retroviral particle", "retrovirus", and "vector particle" mean virus and 
virus-like particles that are capable of introducing a nucleic acid into a cell through a 
viral-like entry mechanism. Such vector particles can, under certain circumstances, 
mediate the transfer of NOIs into the cells they infect. A retrovirus is capable of reverse 
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transcribing its genetic material into DNA and incorporating this genetic material into a 
target cell's DNA upon transduction. Such cells are designated herein as ''target cells". 

A vector particle includes the following components: a retrovirus nucleic acid, which may 
5 contain one or more NOIs, a nucleocapsid encapsidating the nucleic acid, the 
nucleocapsid comprising nucleocapsid protein of a retrovirus, and a membrane 
surrounding the nucleocapsid. 

NUCLEOCAPSID 

10 

The term "nucleocapsid" refers to at least the group specific viral core proteins (gag) and 
the viral polymerase (poI) of a retrovirus genome. These proteins encapsidate the 
retrovirus-packagable sequences and themselves are further surrounded by a membrane 
containing an envelope glycoprotein. 

15 

Preferably a high titre retroviral vector is produced using a codon optimised gag and a 
codon optimised pol or a codon optimised env. 

CODON OPTIMISATION 

20 

As used herein, the terms "codon optimised" and "codon optimisation" refer to an 
improvement in codon usage. By way of example, alterations to the coding sequences for 
viral components may improve the sequences for codon usage in the mammalian cells or 

25 other cells which are to act as the producer cells for retroviral vector particle production. 
This is referred to as "codon optimisation". Many retroviruses, including HIV and other 
lentiviruses, use a large number of rare codons and by changing these to correspond to 
commonly used mammalian codons, increased expression of the packaging components 
in mammalian producer cells can be achieved. Codon usage tables are known in the art 

30 for mammalian cells, as well as for a variety of other organisms. 

Preferably a high titre lentiviral vector is produced using a codon optimised gag and a 
codon optimised pol or a codon optimised env. 
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Preferably a high titre retroviral vector is produced using a modified and/or extended 
packaging signal. 

PACKAGING SIGNAL 

As used herein, the term ''packaging signal" or "packaging sequence*' refers to sequences 
located within the retroviral genome which are required for insertion of the viral RNA 
into the viral capsid or particle. Several retroviral vectors use the minimal packaging 
signal (also referred to as the psi sequence) needed for encapsidation of the viral genome. 
By way of example, this minimal packaging signal encompasses bases 212 to 563 of the 
Mo-MLV genome (Mann et al 1983: Cell 33: 153). 

As used herein, the term "extended packaging signal" or "extended packaging sequence" 
refers to the use of sequences around the psi sequence with further extension into the gag 
gene. The inclusion of these additional packaging sequences may increase the efficiency 
of insertion of vector RNA into viral particles. 

Preferably a high titre lenti viral vector is produced using a modified packaging signal. 

Preferably the lentiviral construct is a based on an EIAV vector genome where all the 
accessory genes are removed except Rev. 

ACCESSORY GENES 

As used herein, the term "accessory genes" refer to a variety of virally encoded accessory 
proteins capable of modulating various aspects of retroviral replication and infectivity. 
These proteins are discussed in Coffin et al (ibid) (Chapters 6 and 7). Examples of 
accessory proteins in lentiviral vectors include but are not limited to tat, rev, nef, vpr, vpu, 
vif, vpx. An example of a lentiviral vector useful in the present invention is one which 
has all of the accessory genes removed except rev. 

Preferably the production of lentiviral vector particles is increased by about 10 fold in the 
presence of EIAV Rev. 
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ENV 

If the retroviral component includes an env nucleotide sequence, then all or part of that 
sequence can be optionally replaced with all or part of another env nucleotide sequence 

5 such as, by way of example, the amphotropic Env protein designated 4070A or the 
influenza haemagglutinin (HA) or the vesicular stomatitis virus G (VSV-G) protein. 
Replacement of the env gene with a heterologous env gene is an example of a technique 
or strategy called pseudotyping. Pseudotyping is not a new phenomenon and examples 
may be found in WO-A-98/05759, WO-A-98/05754, WO-A-97/17457, WO-A-96/09400, 

10 WO- A-9 1/00047 and Mebatsion et al 1997 Cell 90, 841-847. 



In one preferred aspect, the retroviral vector of the present invention has been 
pseudotyped. In this regard, pseudotyping can confer one or more advantages. For 
example, with the lentiviral vectors, the env gene product of the HIV based vectors would 
15 restrict these vectors to infecting only cells that express a protein called CD4. But if the 
env gene in these vectors has been substituted with env sequences from other RNA 
viruses, then they may have a broader infectious spectrum (Verma and Somia 1997 
Nature 389:239-242). By way of example, workers have pseudotyped an HIV based 
vector with the glycoprotein from VSV (Verma and Somia 1997 ibid). 

20 

In another alternative, the Env protein may be a modified Env protein such as a mutant or 
engineered Env protein. Modifications may be made or selected to introduce targeting 
ability or to reduce toxicity or for another purpose (Valsesia-Wittman et al 1996 J Virol 
70: 2056-64; Nilson et al 1996 Gene Therapy 3: 280-6; Fielding et al 1998 Blood 9: 1802 
25 and references cited therein). 

TARGET CELL 



30 



As used herein the term * : target cell" simply refers to a cell which the regulated retroviral 
vector of the present invention, whether native or targeted, is capable of infecting or 
transducing. 
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The lentiviral vector panicle according to the invention will be capable of transducing 
cells which are slowly-dividing, and which non-lentiviruses such as MLV would not be 
able to efficiently transduce. Slowly-dividing cells divide once in about every three to 
four days including certain tumour cells. Although tumours contain rapidly dividing 
cells, some tumour cells especially those in the centre of the tumour, divide infrequently. 

Alternatively the target cell may be a growth-arrested cell capable of undergoing cell 
division such as a cell in a central portion of a tumour mass or a stem cell such as a 
haematopoietic stem cell or a CD34-positive cell. 

As a further alternative, the target cell may be a precursor of a differentiated cell such as a 
monocyte precursor, a CD33-positive cell, or a myeloid precursor. 

As a further alternative, the target cell may be a differentiated cell such as a neuron, 
15 astrocyte, glial cell, microglial cell, macrophage, monocyte, epithelial cell, endothelial 
cell, hepatocyte, spermatocyte, spermatid or spermatozoa. 

Target cells may be transduced either in vitro after isolation from a human individual or 
may be transduced directly in vivo. 

20 NOI 



10 



In accordance with the present invention, it is possible to manipulate the viral genome or 
the regulated retroviral vector nucleotide sequence, so that viral genes are replaced or 
supplemented with one or more NOIs which may be heterologous NOIs. 

25 

The term -'heterologous" refers to a nucleic acid sequence or protein sequence linked to a 
nucleic acid or protein sequence which it is not naturally linked. 

With the present invention, the term NOI (i.e. nucleotide sequence of interest) includes 
30 any suitable nucleotide sequence, which need not necessarily be a complete naturally 
occurring DNA sequence. Thus, the DNA sequence can be, for example, a synthetic 
DNA sequence, a recombinant DNA sequence (i.e. prepared by use of recombinant DNA 
techniques), a cDNA sequence or a partial genomic DNA sequence, including 
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combinations thereof. The DNA sequence need not be a coding region. If it is a coding 
region, it need not be an entire coding region. In addition, the DNA sequence can be in a 
sense orientation or in an anti-sense orientation. Preferably, it is in a sense orientation. 
Preferably, the DNA is or comprises cDNA. 

5 

The NOI(s) may be any one or more of selection gene(s), marker gene(s) and therapeutic 
gene(s). As used herein, the term "selection gene" refers to the use of a NOI which 
encodes a selectable marker which may have an enzymatic activity that confers resistance 
to an antibiotic or drug upon the cell in which the selectable marker is expressed. 

10 

SELECTABLE MARKERS 

Many different selectable markers have been used successfully in retroviral vectors. 
These are reviewed in "Retroviruses" (1997 Cold Spring Harbour Laboratory Press Eds: 

15 JM Coffin, SM Hughes, HE Varmus pp 444) and include, but are not limited to, the 
bacterial neomycin (neo) and hygromycin phosphotransferase genes which confer 
resistance to G418 and hygromycin respectively; a mutant mouse dihydro folate reductase 
gene which confers resistance to methotrexate; the bacterial gpt gene which allows cells 
to grow in medium containing mycophenolic acid, xanthine and aminopterin; the bacterial 

20 hisD gene which allows cells to grow in medium without histidine but containing 
histidinol; the multidrug resistance gene {mdr) which confers resistance to a variety of 
drugs; and the bacterial genes which confer resistance to puromycin or phleomycin. All 
of these markers are dominant selectable and allow chemical selection of most cells 
expressing these genes. Other selectable markers are not dominant in that their use must 

25 be in conjunction with a cell line that lacks the relevant enzyme activity. Examples of 
non-dominant selectable markers include the thymidine kinase {tk) gene which is used in 
conjunction with tk cell lines. 



30 



Particularly preferred markers are blasticidin and neomycin, optionally operably linked to 
a thymidine kinase coding sequence typically under the transcriptional control of a strong 
viral promoter such the SV40 promoter. 



J!,. O O B 3 V) y f w .„ « ) 7" .;!!,. «\h a J 



WO 01/25466 PCT/GB00/03837 

-30- 

NOIs WITH THERAPEUTIC AND/OR DIAGNOSTIC APPLICATIONS 

In accordance with the present invention, suitable NOI sequences include those that are of 
therapeutic and/or diagnostic application such as, but are not limited to: sequences 
5 encoding cytokines, chemokines. hormones, antibodies, engineered immunoglobulin-like 
molecules, a single chain antibody, fusion proteins, enzymes, immune co-stimulatory 
molecules, immunomodulatory molecules, anti-sense RNA, a transdominant negative 
mutant of a target protein, a toxin, a conditional toxin, an antigen, a tumour suppressor 
protein and growth factors, membrane proteins, vasoactive proteins and peptides, anti- 
10 viral proteins and ribozymes, and derivatives therof (such as with an associated reporter 
group). When included, such coding sequences may be typically operatively linked to a 
suitable promoter, which may be a promoter driving expression of a ribozyme(s), or a 
different promoter or promoters, such as in one or more specific cell types. 

1 5 NOIs FOR TREATING CANCER 

Suitable NOIs for use in the invention in the treatment or prophylaxis of cancer include 
NOIs encoding proteins which: destroy the target cell (for example a ribosomal toxin), act 
as: tumour suppressors (such as wild-type p53); activators of anti-tumour immune 

20 mechanisms (such as cytokines, co-stimulatory molecules and immunoglobulins); 
inhibitors of angiogenesis; or which provide enhanced drug sensitivity (such as pro-drug 
activation enzymes); indirectly stimulate destruction of target cell by natural effector cells 
(for example, strong antigen to stimulate the immune system or convert a precursor 
substance to a toxic substance which destroys the target cell (for example a prodrug 

25 activating enzyme). 

PRO-DRUG ACTIVATING ENZYMES 

Examples of prodrugs include but are not limited to etoposide phosphate (used with 
30 alkaline phosphatase; 5-fluorocytosine (with cytosine deaminase); Doxorubin-N-p- 
hydroxyphenoxyacetamide (with Penicillin-V-Amidase); Para-N-bis (2- 
ch!oroethyl)aminobenzoyI glutamate (with Carboxypeptidase G2); Cephalosporin 
nitrogen mustard carbamates (with B-lactamase); SR4233 (with p450 reductase); 
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Ganciclovir (with HSV thymidine kinase); mustard pro-drugs with nitroreductase and 
cyclophosphamide or ifosfamide (with cytochrome p450). 

NOIs FOR TREATING HEART DISEASE 

5 

Suitable NOIs for use in the treatment or prevention of ischaemic heart disease include 
NOIs encoding plasminogen activators. Suitable NOIs for the treatment or prevention of 
rheumatoid arthritis or cerebral malaria include genes encoding anti-inflammatory 
proteins, antibodies directed against tumour necrosis factor (TNF) alpha, and anti- 
10 adhesion molecules (such as antibody molecules or receptors specific for adhesion 
molecules). 

BYSTANDER EFFECT 

15 The expression products encoded by the NOIs may be proteins which are secreted from 
the cell. Alternatively the NOI expression products are not secreted and are active within 
the cell. In either event, it is preferred for the NOI expression product to demonstrate a 
bystander effector or a distant bystander effect; that is the production of the expression 
product in one cell leading to the killing of additional, related cells, either neighbouring or 

20 distant (e.g. metastatic), which possess a common phenotype. Encoded proteins could 
also destroy bystander tumour cells (for example with secreted antitumour antibody- 
ribosomai toxin fusion protein), indirectly stimulated destruction of bystander tumour 
cells (for example cytokines to stimulate the immune system or procoagulant proteins 
causing local vascular occlusion) or convert a precursor substance to a toxic substance 

25 which destroys bystander tumour cells (eg an enzyme which activates a prodrug to a 
diffusible drug). Also, the delivery of NOI(s) encoding antisense transcripts or ribozymes 
which interfere with expression of cellular genes for tumour persistence (for example 
against aberrant myc transcripts in Burkitts lymphoma or against bcr-abl transcripts in 
chronic myeloid leukemia. The use of combinations of such NOIs is also envisaged. 



30 
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CYTOKINES 

The NOI or NOIs of the present invention may also comprise one or more cytokine- 
encoding NOIs. Suitable cytokines and growth factors include but are not limited to: 
5 ApoE, Apo-SAA, BDNF, Cardiotrophin-1, EGF, ENA-78, Eotaxin, Eotaxin-2, Exodus-2, 
FGF-acidic, FGF-basic, fibroblast growth factor- 10 (Marshall 1998 Nature Biotechnology 
16: 129).FLT3 ligand (Kimura et al (1997), Fractalkine (CX3C), GDNF ? G-CSF, GM- 
CSF, GF-ph insulin, IFN-y, IGF-L IGF-IL IL-ia, IL-lp, IL-2, IL-3. IL-4. IL-5, IL-6, IL- 
7, IL-8 (72 a.a.), IL-8 (77 a.a.), IL-9, IL-10, IL-1 L IL-12, IL-13, IL-15, IL-1 6, IL-17, IL- 

10 18 (IGIF), Inhibin a, Inhibin p ; IP- 10, keratinocyte growth factor-2 (KGF-2), KGF, 
Leptin, LIF, Lymphotactin, Mullerian inhibitory substance, monocyte colony inhibitory 
factor, monocyte attractant protein (Marshall 1998 ibid), M-CSF, MDC (67 a.a.), MDC 
(69 a.a.), MCP-1 (MCAF), MCP-2, MCP-3, MCP-4, MDC (67 a.a.), MDC (69 a.a.), 
MIG, MlP-la, MIP-ip, MIP-3a, MIP-3p, MIP-4, myeloid progenitor inhibitor factor- 1 

15 (MPIF-1), NAP-2, Neurturin, Nerve growth factor, p-NGF, NT-3, NT-4 7 Oncostatin M, 
PDGF-AA, PDGF-AB, PDGF-BB, PF-4, RANTES, SDFla, SDFip, SCF, SCGF, stem 
cell factor (SCF), TARC, TGF-a, TGF-P, TGF-P2, TGF-p3, tumour necrosis factor 
(TNF), TNF-cc, TNF-p, TNIL-1, TPO, VEGF, GCP-2, GRO/MGSA. GRO-P, GRO-y, 
HCC1, 1-309. 

20 

The NOI or NOIs may be under the expression control of an expression regulatory 
element, such as a promoter and/or a promoter enhancer as known as "responsive 
elements" in the present invention. 

25 VIRAL DELIVERY SYSTEMS 

When the regulated retroviral vector particles are used to transfer NOIs into cells which 
they transduce, such vector particles also designated "viral delivery systems" or 
"retroviral delivery systems". Viral vectors, including retroviral vectors, have been used 
30 to transfer NOIs efficiently by exploiting the viral transduction process. NOIs cloned into 
the retroviral genome can be delivered efficiently to cells susceptible to transduction by a 
retrovirus. Through other genetic manipulations, the replicative capacity of the retroviral 
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genome can be destroyed. The vectors introduce new genetic material into a cell but are 
unable to replicate. 

The regulated retroviral vector of the present invention can be delivered by viral or non- 
5 viral techniques. Non-viral delivery systems include but are not limted to DNA 
transfection methods. Here, transfection includes a process using a non-viral vector to 
deliver a gene to a target mammalian cell. 

Typical transfection methods include electroporation. DNA biolistics, lipid-mediated 
10 transfection, compacted DNA-mediated transfection, liposomes, immunoliposomes, 
lipofectin, cationic agent-mediated, cationic facial amphiphiles (CFAs) (Nature 
Biotechnology 1996 14; 556), multivalent cations such as spermine, cationic lipids or 
polylysine, 1. 2,-bis (oleoyloxy)-3-(trimethylammonio) propane (DOTAP)-cholesteroI 
complexes (Wolff and Trubetskoy 1998 Nature Biotechnology 16: 421) and combinations 
15 thereof. 

Viral delivery systems include but are not limited to adenovirus vector, an adeno- 
associated viral (AAV) vector, a herpes viral vector, a retroviral vector, a lentiviral vector, 
or a baculoviral vector. These viral delivery systems may be configured as a split-intron 
20 vector. A split intron vector is described in WO 99/1 5683. 

Other examples of vectors include ex vivo delivery systems, which include but are not 
limited to DNA transfection methods such as electroporation, DNA biolistics, lipid- 
mediated transfection, compacted DNA-mediated transfection. 

25 

The vector may be a plasmid DNA vector. Alternatively, the vector may be a 
recombinant viral vector. Suitable recombinant viral vectors include adenovirus vectors, 
adeno-associated viral (AAV) vectors, Herpes-virus vectors, or retroviral vectors, 
lentiviral vectors or a combination of adenoviral and lentiviral vectors. In the case of 
30 viral vectors, gene delivery is mediated by viral infection of a target cell. 
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If the features of adenoviruses are combined with the genetic stability of re tro/lenti viruses 
then essentially the adenovirus can be used to transduce target cells to become transient 
retroviral producer cells that could stably infect neighbouring cells. 

5 PHARMACEUTICAL COMPOSITION 

The present invention also provides a pharmaceutical composition for treating an 
individual by gene therapy, wherein the composition comprises a therapeutically effective 
amount of a regulated retroviral vector according to the present invention. The 
10 pharmaceutical composition may be for human or animal usage. Typically, a physician 
will determine the actual dosage which will be most suitable for an individual subject and 
it will vary with the age, weight and response of the particular patient. 

The composition may optionally comprise a pharmaceutically acceptable carrier, diluent, 
15 excipient or adjuvant. The choice of pharmaceutical carrier, excipient or diluent can be 
selected with regard to the intended route of administration and standard pharmaceutical 
practice. The pharmaceutical compositions may comprise as - or in addition to - the 
carrier, excipient or diluent any suitable binder(s), lubricant(s), suspending agent(s), 
coating agent(s), solubilising agent(s), and other carrier agents that may aid or increase 
20 the viral entry into the target site (such as for example a lipid delivery system). 

Where appropriate, the pharmaceutical compositions can be administered by any one or 
more of: minipumps, inhalation, in the form of a suppository or pessary, topically in the 
form of a lotion, solution, cream, ointment or dusting powder, by use of a skin patch, 

25 orally in the form of tablets containing excipients such as starch or lactose, or in capsules 
or ovules either alone or in admixture with excipients, or in the form of elixirs, solutions 
or suspensions containing flavouring or colouring agents, or they can be injected 
parenterally, for example intracavernosally, intravenously, intramuscularly or 
subcutaneously. For parenteral administration, the compositions may be best used in the 

30 form of a sterile aqueous solution which may contain other substances, for example 
enough salts or monosaccharides to make the solution isotonic with blood. For buccal or 
sublingual administration the compositions may be administered in the form of tablets or 
lozenges which can be formulated in a conventional manner. 
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DISORDERS 



The present invention is believed to have a wide therapeutic applicability - depending on 
inter alia the selection of the one or more NOIs. 

5 

For example, the present invention may be useful in the treatment of the disorders listed 
in WO-A-98/05635. For ease of reference, part of that list is now provided: cancer, 
inflammation or inflammatory disease, dermatological disorders, fever, cardiovascular 
effects, haemorrhage, coagulation and acute phase response, cachexia, anorexia, acute 

10 infection, HIV infection, shock states, graft-versus-host reactions, autoimmune disease, 
reperfusion injury, meningitis, migraine and aspirin-dependent anti-thrombosis; tumour 
growth, invasion and spread, angiogenesis, metastases, malignant, ascites and malignant 
pleural effusion; cerebral ischaemia, ischaemic heart disease, osteoarthritis, rheumatoid 
arthritis, osteoporosis, asthma, multiple sclerosis, neurodegeneration. Alzheimer's disease, 

15 atherosclerosis, stroke, vasculitis, Crohn's disease and ulcerative colitis; periodontitis, 
gingivitis; psoriasis, atopic dermatitis, chronic ulcers, epidermolysis bullosa; corneal 
ulceration, retinopathy and surgical wound healing; rhinitis, allergic conjunctivitis, 
eczema, anaphylaxis; restenosis, congestive heart failure, endometriosis, atherosclerosis 
or endosclerosis. 

20 

In addition, or in the alternative, the present invention may be useful in the treatment of 
disorders listed in WO-A-98/07859. For ease of reference, part of that list is now 
provided: cytokine and cell proliferation/differentiation activity; immunosuppressant or 
immunostimulant activity (e.g. for treating immune deficiency, including infection with 

25 human immune deficiency virus; regulation of lymphocyte growth; treating cancer and 
many autoimmune diseases, and to prevent transplant rejection or induce tumour 
immunity); regulation of haematopoiesis, e.g. treatment of myeloid or lymphoid diseases; 
promoting growth of bone, cartilage, tendon, ligament and nerve tissue, e.g. for healing 
wounds, treatment of burns, ulcers and periodontal disease and neurodegeneration; 

30 inhibition or activation of follicle-stimulating hormone (modulation of fertility); 
chemotactic/chemokinetic activity (e.g. for mobilising specific cell types to sites of injury 
or infection); haemostatic and thrombolytic activity (e.g. for treating haemophilia and 
stroke); antiinflammatory activity (for treating e.g. septic shock or Crohn's disease); as 
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antimicrobials; modulators of e.g. metabolism or behaviour; as analgesics; treating 
specific deficiency disorders; in treatment of e.g. psoriasis, in human or veterinary 
medicine. 

5 In addition, or in the alternative, the present invention may be useful in the treatment of 
disorders listed in WO-A-98/09985. For ease of reference, part of that list is now 
provided: macrophage inhibitory and/or T cell inhibitory activity and thus, anti- 
inflammatory activity; anti-immune activity, i.e. inhibitory effects against a cellular 
and/or humoral immune response, including a response not associated with inflammation; 

10 inhibit the ability of macrophages and T cells to adhere to extracellular matrix 
components and fibronectin, as well as up-regulated fas receptor expression in T cells; 
inhibit unwanted immune reaction and inflammation including arthritis, including 
rheumatoid arthritis, inflammation associated with hypersensitivity, allergic reactions, 
asthma, systemic lupus erythematosus, collagen diseases and other autoimmune diseases, 

15 inflammation associated with atherosclerosis, arteriosclerosis, atherosclerotic heart 
disease, reperfusion injury, cardiac arrest, myocardial infarction, vascular inflammatory 
disorders, respiratory distress syndrome or other cardiopulmonary diseases, inflammation 
associated with peptic ulcer, ulcerative colitis and other diseases of the gastrointestinal 
tract, hepatic fibrosis, liver cirrhosis or other hepatic diseases, thyroiditis or other 

20 glandular diseases, glomerulonephritis or other renal arid urologic diseases, otitis or other 
oto-rhino-laryngological diseases, dermatitis or other dermal diseases, periodontal 
diseases or other dental diseases, orchitis or epididimo-orchitis, infertility, orchidal 
trauma or other immune-related testicular diseases, placental dysfunction, placental 
insufficiency, habitual abortion, eclampsia, pre-eclampsia and other immune and/or 

25 inflammatory-related gynaecological diseases, posterior uveitis, intermediate uveitis, 
anterior uveitis, conjunctivitis, chorioretinitis, uveoretinitis, optic neuritis, intraocular 
inflammation, e.g. retinitis or cystoid macular oedema, sympathetic ophthalmia, scleritis, 
retinitis pigmentosa, immune and inflammatory components of degenerative fondus 
disease, inflammatory components of ocular trauma, ocular inflammation caused by 

30 infection, proliferative vitreo-retinopathies, acute ischaemic optic neuropathy, excessive 
scarring, e.g. following glaucoma filtration operation, immune and/or inflammation 
reaction against ocular implants and other immune and inflammatory-related ophthalmic 
diseases, inflammation associated with autoimmune diseases or conditions or disorders 
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where, both in the central nervous system (CNS) or in any other organ, immune and/or 
inflammation suppression would be beneficial, Parkinson's disease, complication and/or 
side effects from treatment of Parkinson's disease, AlDS-related dementia complex HIV- 
related encephalopathy, Devic's disease, Sydenham chorea, Alzheimer's disease and other 
5 degenerative diseases, conditions or disorders of the CNS, inflammatory components of 
stokes, post-polio syndrome, immune and inflammatory components of psychiatric 
disorders, myelitis, encephalitis, subacute sclerosing pan-encephalitis, encephalomyelitis, 
acute neuropathy, subacute neuropathy, chronic neuropathy, Guillaim-Barre syndrome, 
Sydenham chora, myasthenia gravis, pseudo-tumour cerebri, Down's Syndrome, 

10 Huntington's disease, amyotrophic lateral sclerosis, inflammatory components of CNS 
compression or CNS trauma or infections of the CNS, inflammatory components of 
muscular atrophies and dystrophies, and immune and inflammatory related diseases, 
conditions or disorders of the central and peripheral nervous systems, post-traumatic 
inflammation, septic shock, infectious diseases, inflammatory complications or side 

15 effects of surgery, bone marrow transplantation or other transplantation complications 
and/or side effects, inflammatory and/or immune complications and side effects of gene 
therapy, e.g. due to infection with a viral carrier, or inflammation associated with AIDS, 
to suppress or inhibit a humoral and/or cellular immune response, to treat or ameliorate 
monocyte or leukocyte proliferative diseases, e.g. leukaemia, by reducing the amount of 

20 monocytes or lymphocytes, for the prevention and/or treatment of graft rejection in cases 
of transplantation of natural or artificial cells, tissue and organs such as cornea, bone 
marrow, organs, lenses, pacemakers, natural or artificial skin tissue. 

INTRODUCTION TO THE EXAMPLES SECTION AND THE FIGURES 

25 

The present invention will now be described only by way of example in which reference 
is made to the following Figures: 

Figure 1 shows an MLV-based transduction method using a Cre/LoxP system as 
30 described by Vanin et al ibid (1997); 



Figure 2 shows an EIAV-based transduction method using a Cre/Lox system; 
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Figure 3 shows an iVILV SIN vector construct transduction method with an EIAV/HIV 
genome insertion using a Cre/Lox system; 

Figure 4 shows an MLV-based transduction method with HRE 3TTR using a Cre/Lox P 
5 system; 

Figure 5 shows an MLV-based transduction method for MLV SIN vector production 
using a Cre/Lox P system; 

10 Figure 6 shows an MLV-based transduction method with integration of a complete 
second genome construct using a Cre/LoxP system; 

Figure 7 shows the basis molecular organisation of an RNA genome and a proviral DNA 
genome; 

15 

Figure 8 shows a schematic diagram of pTrap2 and pONY8z-loxP plasmids; 

Figure 9 shows an overall summary of the recombinase method; 

20 Figure 10a shows a FACS analysis of EV1 packaging cells prior to transduction with 
Trap2 vector; 

Figure 10b shows FACS analysis of EV1 packaging cell line transduced with Trap2 at an 
MOI of 0.3. A 5% top slice of the highest expressers was carried out; 

25 

Figure 1 1 shows a validation of the method for quantitation of GFP mRNA, relative to p- 
actin. A titration of the total RNA from EV1 clone A was used. The difference in Ct 
values between the two assays is shown on the y axis. The magnitude of the gradient 
must be <0.1 for the method to be valid. The gradient is 0.077. so the method is suitable; 

30 

Figure 12 shows the quantitation of GFP mRNA relative to control P-actin mRNA. EV2 
TD cells are transduced with Trap2 at an MOI of 0.3 and are the calibrator sample with 
the ratio designated 1.0; 
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Figure 13 shows FACS analysis of EV1 clone A: 

Figure 13A shows original GFP expression of the clone; 

5 Figure 13B shows GFP expression 7 days after transfection with Cre recombinase 
(pBSl85). Excision frequency is 64%; 

Figure 13C shows recombined clone 4 identified as being negative for GFP: 

10 Figure 14 shows lacZ expression of transfected cells with and without the addition of the 
Cre recombinase (pBSl 85). Figure 14 shows EV1 A4 and EV2D4 clones with and 
without the addition of Cre recombinase (pBS185). The efficiency of the insertion event 
was estimated to be about 12% by computer image analysis: 

15 Figure 1 5 shows the structure of pONY8. 1Z MLVHyb; 

Figure 16 shows the alignment of leader and gag regions present in vectors pONY4Z, 8Z 
and ATG mutated 8Z vector. The latter is referred to as pONY8ZA. The sequences 
aligned are from the Narl site in the leader to the Xbal site between the EIAV gag 
20 sequence and the CMV promoter. Sequences in the leader are shown in italic and a space 
is present upstream of the position of the gag ATG; and 

Figure 17 shows a schematic representation of the structure of pONY 8.3G +/- vector 
genome plasmids. 

25 

EXAMPLES 

EXAMPLE 1 

30 Vanin et al {ibid) describe a recombinase system whereby an initial retroviral 
transduction event introduces retroviral LTRs and expressed gene/s flanked by two 
recombinase target sites (exemplified by loxP) into a cell line. Stable transduced cell lines 
are selected by resistance to the antibiotic neomycin and screened for high expression of 
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the expressed gene(s) (see Figure 1). Such cell lines (Cell Line 1) contain retroviral 
insertions in integration sites that support high level expression from the retroviral 
genome. 

5 The next step involves the transfection of the relevant recombinase expression construct 
(exemplified here by Cre recombinase) into the identified high expressing cell line. The 
expressed gene(s) is/are excised and a single loxP site is retained in the construct (Cell 
Line 2). In this instance, thymidine kinase gene (tk) is used as a negative selectible 
marker in combination with the drug, gancyclovir. The final step involves the re- 
10 insertion of a therapeutic or marker gene of choice into the single loxP site via a Cre- 
assisted mechanism. Cell lines are identified that have been successfully recombined 
(Cell Line 3) and they will produce retroviruses at the same titre as the parental Cell Line 
1. 

15 EXAMPLE 2 

Figure 2 and Figure 3 describe the production of EIAV or HIV high titre transduced 
producer cell lines. 

20 Figure 2 shows a minimal EIAV genome construct with the 3* U3 sequences replaced by 
a strong constitutive promoter, CMV. A reporter gene such as blasticidin resistance gene 
(bsr) is flanked by loxP sites. Virus is made in a transient system and is transduced into 
an EIAV producer cell line and clones identified that maximally express the blast marker 
gene. A line is chosen (termed Ceil Line 1) and the marker gene is excised by a Cre 

25 recombinase-assisted excision event, generating Cell Line 2. 

Construct B comprises two loxP sites which flank an internal expression cassette and also 
the native EIAV 3' LTR. Therefore, this construct is recombined into the cell line such 
that the 5' R and U5 sequences are inherited from the packaging cell line, whereas the 3' 
30 LTR sequences are wholly derived from the recombined construct. The 3' LTR from 
Cell Line 2 is present downstream of the functional EIAV genome expression construct. 
This CMV-R-U5 module is still transcriptionally active but expression is directed away 
from the EIAV genome. 
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Figure 3 shows a further aspect of the invention. Construct C is based on an MLV SIN 
vector, with a deletion in the 3' U3 sequences. The cassette includes an internal CMV 
promoter linked to EIAV R and U5 sequences. This is followed by a blasticidin resistance 

5 gene (bsr) flanked by two loxP sites. Virus is made in a transient transfection system and 
the genome is transduced into a packaging line. Blast-resistant clones are identified and 
the highest expressing line is chosen for further analysis. This line is transfected with Cre 
recombinase and the blast gene is excised. The last step involves the insertion of construct 
B into the single loxP site. Once again, a complete EIAV 3' LTR is introduced into the 

10 producer cell line. This leads to a CMV -driven EIAV genome expression cassette with 
the EIAV 3' LTR still located at the 3' end of the genome. Transcriptionally quiescent 
MLV SIN LTRs flanks these EIAV sequences. 

EXAMPLE 3 

15 

Figure 4 shows an additional aspect of the invention. Construct D is an MLV -based 
vector with a CMV promoter in the 3' LTR in place of the U3 sequences. Virus is made 
in a transient system and is transduced into a packaging cell line as described previously. 
The neo and TK genes are excised by the action of Cre recombinase and construct E is 

20 recombined into the single loxP target sequence. The modified MLV 3' LTR including 
the HRE or similarly regulated system is transferred into the packaging cell line by the 
recombinase mechanism. Therefore, the 5' R and U5 sequences are inherited from the 
producer cell line whereas the therapeutic and marker gene/s and regulated 3' LTR is 
inherited from construct E. The final producer cell line is constitutively driven by the 5 ! 

25 CMV promoter and will produce high titre retroviral vectors which are regulated in the 
transduced target cells. This approach avoids the derivation of low titre transfected 
producer cell lines or the use of hypoxic conditions or chemical mimics for production 
from traditionally derived transduced producer lines. 

30 EXAMPLE 4 

Figure 5 shows yet another aspect of the invention. Construct D is an MLV-based vector 
with a CMV promoter in the 3' LTR as previously described. The same process is carried 
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out as shown in Figure 4 until the final recombination is performed. Construci F contains 
a deletion in U3 sequences in the 3* LTR and an internal expression cassette comprising a 
promoter and gene sequences. The final cell line containing the C re-mediated 
recombination will be CMV-driven and will constitutively produce high titre MLV SIN 
vectors. Previously. SIN vectors have not been amenable to production by stable cell line 
producer technology. Instead they have been prepared using transfection-based transient 
expression systems. 

EXAMPLE 5 

Figure 6 shows an MLV-based transduction method with integration of complete second 
genome construct by Cre/LoxP system. In this approach, construct 1 is called TRAP1) is 
an MLV vector construct containing an internal CMV promoter operably linked to a 
marker gene (a truncated form of the human low affinity nerve growth factor receptor, 
called LNGFR). The enhancer elements in the 3' U3 sequence have been excised and 
replaced by a 34bp loxP site. Virus stocks are prepared in a transient system and the 
TRAP1 genome is stably transduced into packaging cell lines. 

The modified 3 ? U3 sequences, including the lox P sequence, is copied from the 3 'LTR 
position to the 5'LTR, such that there is little 5' promoter activity. Cell lines are screened 
for high levels of expression of LNGFR protein by fluorescent activated cell sorter 
(FACS) analysis and clonal lines are derived by standard techniques. A Cre recombinase 
expression plasmid is transfected into the derived cell line to excise all sequences 
between the two loxP sites. Next, cells are negatively selected by FACS for absence of 
LNGFR expression and clonal lines are derived by standard techniques. Construct 2 in 
this example comprises a complete HIV or EIAV or also MLV retroviral genome, which 
is flanked by two minimal 34bp loxP recombinase sites. A strong constitutive promoter 
such as CMV directs transcription of the genome. On transfection of plasmid 2 and Cre 
expression plasmid, the complete lentivirus vector or MLV vector genome is inserted in 
the producer cell line. These sequences are flanked to the 5' by a small portion of MLV 
U3 sequence and a loxP site and to the 3' by the second loxP site, enhancerless-U3 
sequences, R and U5 derived from the MLV construct 1 . 
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Derivation of Plasmid TRAP1 (Figure 6 - Construct 1) 

Oligonucleotides VSAT129 and VSAT130 were synthesised which correspond to the 
5 minimal 34bp loxP sites and contain a 5* overhang for Nhel and a 3* overhang for Xbal. 
The sequences 5" to 3 ? are as follows: VSAT129 (CTAGCATAACTTCGTATA 
ATGTATGCTATACGAAGTTATT) (SEQ ID No 49) and VSAT130 
(CTAGAATAACTTCGTATAGC ATACATTATACGAAGTTATG) (SEQ ID No 50). 
The two oligonucleotides were treated with T4 polynucleotide kinase and were heated to 
10 95 °C for 5 minutes, before gradual cooling to room temperature. The annealed and 
kinased oligos were ligated to a 2,830 bp Nhel/Xbal fragment from LTR plasmid (SEQ 
ID No 59). Fragments were ligated and correct clones of LTRloxP were identified by 
sequence analysis. Plasmid LTRloxP was then digested with Nhel and Seal and a 
2,185bp fragment was prepared for following cloning steps. 

15 

Piasmids TRAP1 and TRAP1G were derived from LTRloxP and the MLV genome 
CGCLNGFR (encodes GFP and LNGFR from an internal CMV promoter - see SEQ ID 
No 57). However, the GFP gene was excised by EcoRI/BsmI digestion and the 6,796bp 
fragment was filled in by T4 DNA polymerase and re-ligated, in order to generate 
20 plasmid CXCLNGFR. Plasmid TRAP1 was generated by ligation of a 2 5 1 85bp Nhel/Scal 
fragment from LTRloxP (see SEQ ID No 58) to a 4,426bp Nhel/Scal fragment from 
CXCLNGFR. Plasmid TRAP1G was generated by ligation of a 2,185bp Nhel/Scal 
fragment from LTRloxP to a 5,179bp Nhel/Scal fragment from CGCLNGFR. 

25 Derivation of Plasmid pONY8z-Iox (Figure 6 - Construct 2) 

In this example, the retroviral genome inserted into the loxP site in Figure 6 was based on 
the EIAV vector genome, pONY8z (for preparation see pONYS.OZ construction below). 
pONYSz was cut with SnaBI and Nrul, and the 4358bp fragment purified and self-ligated 
30 to form pONY8z~shuttle. This plasmid has unique 5' sites (Dralll and Bglll) and unique 
3' sites (PvuII and BspLUII).' Oligonucleotides encoding the 34bp loxP sites were 
inserted with suitable base pair overhangs at the unique 5' Dralll site and then the unique 
3' BspLUIL to generate plasmid pONY-8z-shuttleloxP. 
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Plasmid pONY8z-IoxP was made as follows. Plasmid pONY-8z-shuttleloxP was 
digested with BsrG I and NspV, and the 3670bp fragment was purified as the vector 
fragment. The insert for ligation to this fragment was derived from pONYSz by partial 
5 digestion with BsrGl (two sites) followed by digestion with NspV. A 7.328bp fragment 
was purified and ligated to the 3670bp fragment described above. 

The Cre recombinase plasmid as used in this system is pBS185 (Gibco). 

10 EXAMPLE 6 

We constructed an MLV self-inactivating (SIN) vector called pTrap2 (see SEQ ID No 56) 
by replacing the 3' U3 Nhel-Xbal fragment with a 34-bp loxP sequence. The vector 
transcribes the marker gene GFP from an internal CMV promoter. Trap2 vector was used 

15 to transduce EIAV packaging cell lines EV1 and EV2. The EV cell lines are based on 
human TE671 cells and express EIAV gag/pol proteins and VSV-G envelope, regulated 
by a temperature-sensitive switch. High expressing clones of transduced EV1 and EV2 
cells were identified by FACS analysis for GFP. Individual clones expressing high levels 
of GFP were then selected. The GFP expression cassette was excised following transient 

20 transfection with a Cre recombinase expression plasmid. The derived cell line, EV-loxP, 
contains a single loxP site and minimal sequences derived from the MLV construct 
pTrap2. An EIAV genome was engineered to contain loxP sites flanking the entire vector 
genome. 

25 This genome construct and Cre recombinase were co-transfected into EV-loxP. Stable 
cell lines expressing lacZ were selected by FACS and cell lines were cloned by limiting 
dilution. Therefore, we have introduced an entire EIAV genome expression cassette into a 
single loxP site. This site was previously identified by MLV transduction as highly 
permissive for transgene expression. A 5' CMV promoter transcribes the lenti viral 

30 genome in the producer cell line but the expression site was originally identified by MLV 
transduction. This method is adaptable to the generation of transduced producer cell lines 
for other lentiviral vector systems. 
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Materiais and Methods 

Vector construction: Plasmid pTrap2 was made as follows: A plasmid containing a 
single MLV LTR plasmid (LTRplasmid - SEQ ID No 59) was digested with Nhel and 
5 Xbal and a 34 bp minimal loxP site was introduced with relevant sticky ends. This 
insertion step removes the MLV U3 enhancer elements which lie within the excised Nhel- 
Xbal fragment. The LTR-loxP plasmid was linearised by digestion with Nhel and was 
ligated to a 6.8kb Nhel fragment from the MLV construct CZCG (See SEQ ID No 55). 
This construct expresses lacZ from the 5' U3 promoter and GFP from an internal CMV 
10 promoter. The resulting pTrap2 construct is shown in Figure 8. 

The EIAV genome construct pONY8.0Z and pONY8.1Z were prepared as follows: 

pONY8.0Z construction 

15 

pONY8.0Z was derived from pONY4.0Z (see WO 99/32646) by introducing mutations 
which 1) prevented expression of TAT by an 83nt deletion in the exon 2 of tat ) prevented 
S2 ORE expression by a 51nt deletion 3) prevented REV expression by deletion of a 
single base within exon 1 of rev and 4) prevented expression of the N-terminal portion of 
20 gag by insertion of T in ATG start codons, thereby changing the sequence to ATTG from 
ATG. With respect to the wild type EIAV sequence Acc. No. U01866 these correspond 
to deletion of nt 5234-5316 inclusive, nt 5346-5396 inclusive and nt 5538. The insertion 
of T residues was after nt 526 and 543. 

25 pONY8.1Z construction 

pONY8.1Z was obtained directly from pONY8.0Z by digestion with Sail and partial 
digestion with Sapl. Following restriction the overhanging termini of the DNA were 
made blunt ended by treatment with T4 DNA polymerase. The resulting DNA was then 
30 religated. This manipulation results in a deletion of sequence between the LacZ reporter 
gene and just upstream of the 3 'PPT. The 3' border of the deletion is nt 7895 with 
respect to wild type EIAV, Acc. No. U01866. Thus pONY8.1Z does not contain 
sequences corresponding to the EIAV RREs. 



WO 01/25466 



PCT/GB00/03837 



-46- 

Plasmid pONYSz was linearised by flg/II, and a single loxP site was cloned into 5g/II, 
immediately upstream of the 5* CMV promoter, to produce pONY8z-loxP. Plasmids 
pONY3.2iresHYG and pHCMV-VSVG were used in the derivation of cell lines EV1 and 
EV2. The plasmid pONY3.2iresHYG was constructed as follows: 

pONY3.2IREShyg 

pONY3.IREShyg was derived from pONY3.2. pONY3.2 is a derivative of pONY3.1 in 
which expression of TAT and S2 are ablated by an 83nt deletion in the exon 2 of tat a 
5 lnt deletion in S2 ORR With respect to the wild type EIAV sequence Acc. No. U01866 
these correspond to deletion of nt 5234-5316 inclusive and nt 5346-5396 inclusive. This 
fragment was introduced into the expression vector pHORSE IRES hyg which was made 
as follows. pHORSE (see WO 99/32646) was cut with SnaBI and NotI which excises a 
fragment running from the CMV promoter through EIAV gag/pol and introduced into 
pIRESlhyg (Clontech) digested with the same enzymes. This plasmid was then cut with 
Sse8387I and BstEII and then ligated with the Sse8387I to BstEII fragment from 
pONY3.2. The sequence of the plasmid is set out in SEQ ID No 5 1 . 

Virus Production 

Transient MLV vector preparations pseudotyped with RD114 cat endogenous envelope 
were made as described previously (Soneoka et aL, 1995). EIAV vector was harvested 
from confluent monolayers following 3 days induction of VSV-G expression at 32°C. 
MLV vector preparations were titred in triplicate on HT1080 fibrosarcoma cells. EIAV 
vector preparations were titred by GFP and lacZ on D17 dog osteosarcoma cells. 

Flow cytometry of b-galactosidase and GFP activity: 

1.5x10 s cells from a 12-well plate were analysed for lacZ expression using the 
FluoReporter lacZ Flow Cytometry kit (Molecular Probes). GFP expression was also 
directly assessed using the FACSCalibur flow cytometer (Beckton Dickinson). 
Transfection methods 
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Calcium phosphate transfections were carried out using the Protection kit (Promega) 
according to manufacturer's instructions. 

5 Results 

Figure 8 shows a schematic diagram of pTrap2 and pONY8z-loxP ? plasmids used in this 
study. 

10 Introduction of Trap2 genome into EV1 and EV2 

An overall summary of the process described here is given in Figure 9. Trap2 MLV 
vector was made in a transient system with the amphotropic 4070A envelope. It gave a 
GFP titre of 1.7xl0 6 T.U. per ml. Trap2 vector however also gave a lacZ titre of 9.4x10 
15 T.U. per ml. This shows that replacement of the Nhel-Xbal fragment from the MLV U3 
region with loxP does not completely inactivate the MLV U3 promoter. Therefore Trap2, 
as constructed, is a partial SIN vector. 

EV1 and EV2 cells were transduced with Trap2 vector at a multiplicity of infection 
20 (MOI) of 0.3. This was done to insert single copies of the MLV genome into the 
packaging lines. 

Derivation of high expressers of GFP marker gene 

25 Transduced EV1 and EV2 cells were analysed by FACS (see Figure 10) and the top 5% 
of GFP expressing cells were sorted and expanded. Clonal lines were derived by limiting 
dilution and four clones of EV1 and EV2 were chosen by visual inspection. 

A quantitative TaqMan RT-PCR reaction was established in order to identify which of the 
30 four clones of EV1 and EV2 were the highest expressors of GFP mRNA. Total RNA was 
analysed by RT-PCR for GFP and (3-actin. Quantitation was calculated by direct 
comparison of the Ct values (Cycle threshold). This was possible as it was proved that the 
two individual RT-PCR reactions are of similar efficiency (see Figure 11). By identifying an 
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optimal chromosomal location for GFP transgene expression, we can ensure that the 
inserted loxP site will be highly permissive for expression of an inserted lentiviral 
genome construct. 

5 Figure 5 shows the n-fold difference in GFP : p-actin ratio for clones EV1 A to D and 
EV2 A to D. All ratios are defined relative to a calibrator sample, defined as a ratio of 1.0. 
The calibrator sample used was RNA from EV2 cells transduced with Trap2 at an MOI of 
0.3. 

!0 This identified the best expressing lines as: 

• EV1 clone A - GFP : p-actin ratio is 22.8 

• EV2 clone D - GFP : (3-actin ratio is 1 8.6 

These two lines were carried forward for further study. 

15 

Excision of internal expression cassette by Cre recombinase 

The process of retroviral integration copies the loxP-containing modified 3' U3 to the 5' 
position. Therefore, one can excise the majority of the MLV Trap2 integration by the 
20 action of Cre recombinase. This will leave a single modified LTR, suitable for lentiviral 
genome integration. 

EV1 clone A and EV2 clone D were transfected by the Cre expression plasmid pBS185 
(Life Technologies). After one week, the cells were analysed for GFP by FACS (see 
25 Figure 13) to determine the excision frequency. This was measured at 20-70% in all lines. 

Recombined clones were identified by limit dilute cloning cells and checking by 
microscope and FACS for loss of GFP expression. 

30 

Insertion of EIAV genome into loxP site 
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Plasmid pONYSx-loxP and pBSlSS were co-transfected using Fugene into EVl clone A 
(excised) and EV2 clone D (excised). A control transfection of pONY8z-loxP in the 
absence of pBSl85 was also carried out. 

Figure 14 shows lacZ expression of transfected cells withand without the addition of Cre 
recombinase (pBS185). The efficiency of the insertion event was estmated to be -12% by 
computer image analysis. 

We analysed cells for lacZ expression by FACS using the FluoReporter lacZ Flow 
Cytometry kit. The top 5% of lacZ positive cells were sorted by FACS and clones were 
derived by limiting dilution. In total, 12 clones of EVl/A/pONY8z-loxP were derived and 
13 clones of EV2/D/pONY8z-loxP. 

EXAMPLE 7 

Construction of EIAV vectors with LTR driven open reading frames 



The EIAV vector configurations described previously utilise a single promoter - 
transgene cassette located internally in the vector. For example in pONY8Z the 
promoter-transgene cassette is CMV-LacZ. However for some uses it would be 
advantageous to have the option of expressing a gene from the 5 'LTR promoter as well. 
For example a marker gene such as green fluorescent protein (GFP), a resistance marker 
such as neomycin phosphotransferase (neo) or another protein or a biologically active 
entity such as a ribozyme. Previous experiments have shown that the EIAV LTR is 
weakly active in human cells in the absence of EIAV tat. However the transcriptional 
activity of the LTR can be increased by replacement of the EIAV U3 region with the 
MLV U3 region or the CMV promoter. This is achieved by introducing these alterations 
in the 3'LTRs of the vector plasmids. As a result of the replicative strategy of 
retroviruses the modified 3' LTR becomes positioned at the 5 'end of the integrated vector 
and can thus drive expression of a gene placed downstream of the gag region. To ensure 
optimal levels of expression there should preferably be no ATG start codons prior to the 
start codon of the gene to be expressed. In pONY8Z the ATG start codon of gag and the 
next ATG downstream were mutated to ATTG in order to ablate expression of the 
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aminoterminai portion of gag present in the vectors, however there are 7 other ATG 
codons further downstream of these, within gag, from which translation might be 
initiated. 



Described below are the replacement of the U3 region of EIAV with MLV or CMV 
promoters and the mutation of ATG codons in the gag region 

Replacement of the EIAV U3 region with MLV U3 or CMV promoters 

The MLV U3 region was introduced into pONYSZ vector by replacement of the 3'LTR 
with a synthetic MLV/EIAV LTR made by the overlapping PCR technique, using the 
following primers and templates. 

The EIAV PPT/U3 sequence was amplified from pONY8.1Z using primers: 
KM001: CAAAGCATGCCTGCAGGAATTCG (SEQ ID No 1) 



and 

KM003: 

GCCAAACCTACAGGTGGGGTCTTTCATTATAAAACCCCTCATAAAAACCCCAC 
AG (SEQ ID No 2) 

to give the following product: 

CAAAGCATGCCTGCAGGAATTCGATATCAAGCTTATCGATACCGTCGAATTG 

GAAGAGCTTTAAATCCTGGCACATCTCATGTATCAATGCCTCAGTATGTTTAG 

AAAAACAAGGGGGGAACTGTGGGGTTTTTATGAGGGGTTTTATAATGAAAGA 

CCCCACCTGTAGGTTTGGC (SEQ ID No 3) 



The MLV U3 region was amplified from pHITl 1 1 (Soneoka et ah, (1995) Nucleic Acids 
Res. 23, 628-633) using KM004: 
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CTGTGGGGTTTTTATGAGGGGTTTTATAATGAAAGACCCCACCTGTAGGTTTG 
GC (SEQ ID No 4) 
and 

5 KM005: 

GAAGGGACTCAGACCGCAGAATCTGAGTGCCCCCCGAGTGAGGGGTTGTGGG 
CTCT (SEQ ID No 5) to give the following product: 

10 CTGTGGGGTTTTTATGAGGGGTTTTATAATGAAAGACCCCACCTGTAGGTTTGGCAAGCTAGCT 
TAAGTAACGCCATTTTGCAAGGCATGGAAAAATACATAACTGAGAATAGAGAAGTTCAGATC 
AAGGTCAGGAACAGATGGAACAGCTGAATATGGGCCAAACAGGATATCTGTGGTAAGCAGTT 
CCTGCCCCGGCTCAGGGCCAAGAACAGATGGAACAGCTGAATATGGGCCAAACAGGATATCT 
GTGGTAAGCAGTTCCTGCCCCGGCTCAGGGCCAAGAACAGATGGTCCCCAGATGCGGTCCAGC 

15 CCTCAGCAGTTTCTAGAGAACCATCAGATGTTTCCAGGGTGCCCCAAGGACCTGAAATGACCC 
TGTGCCTTATTTGAACTAACCAATCAGTTCGCTTCTCGCTTCTGTTCGCGCGCTTCTGCTCCCCG 
AGCTCAATAAAAGAGCCCACAACCCCTCACTCGGGGGGCACTCAGATTCTGCGGTCTGAGTCC 
CTTC (SEQ ID No 6) 

20 The MLV U3/EIAV R/U5 was amplified from pONY8. 1Z using primers 
KM002: GAGCGCAGCGAGTCAGTGAGCGAG (SEQ ID No 7) and 
KM006: 

25 

AGAGCCCACAACCCCTCACTCGGGGGGCACTCAGATTCTGCGGTCTGAGTCC 
CTTC (SEQ ID No 8) 

to give the following product: 

30 

AGAGCCCACAACCCCTCACTCGGGGGGCACTCAGATTCTGCGGTCTGAGTCCCTTCTCTGCTG 
GGCTGAAAAGGCCTTTGTAATAAATATAATTCTCTACTCAGTCCCTGTCTCTAGTTTGTCTGTT 
CGAGATCCTACAGAGCTCATGCCTTGGCGTAATCATGGTCATAGCTGTTTCCTGTGTGAAATTG 
TTATCCGCTCACAATTCCACACAACATACGAGCCGGAAGCATAAAGTGTAAAGCCTGGGGTGC 
35 CTAATGAGTGAGCTAACTCACATTAATTGCGTTGCGCTCACTGCCCGCTTTCCAGTCGGGAAAC 
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CTGTCGTGCCAGCTGCATTAATGAATCGGCCAACGCGCGGGGAGAGGCGGTTTGCGTATTGGG 
CGCTCTTCCGCTTCCTCGCTCACTGACTCGCTGCGCTC (SEQ ID No 9) 

The PCR products described above were purified and then used as templates in new PCR 
reactions to link them together to obtain a 992bp product. The final product contains two 
Sapl sites which flank the hybrid LTR. These allow introduction of the PCR product into 
the corresponding Sapl sites present in the pONYSZ or pONYS.lZ vector plasmid, 
thereby creating pONY8Z MLVHyb and pONY8.1 MLVHyb. The sequence of the 
hybrid LTR in these plasmids was confirmed by sequencing. The titres obtained from the 
vectos in transient transfection assays are shown in Table 1 . The titres were very similar 
to the titres from the parental construct pONY8Z and pONYS.lZ indicating that 
replacement of the EIAV U3 region with that of MLV had little or no detrimental effect 
on the infectious cycle of the vectors. 

5 Table 1. Titres obtained from MLV hybrid LTR vector plasmids 



vector plasmid 


# titre (l.f.u./ml) 


P ONY8Z 


3 x 10 3 


pONY8Z MLVHyb 


1 x 10" 


pONY8.1Z 


6x 10 4 


pONY8.1Z MLVHyb 


2 x 10 4 



'Titre was measured on D17 cells and is expressed as LacZ forming units/mi (l.f.u./ml). 
Transfections were carried out in 293T cells using the vector plasmid shown and pRV67 
0 (VSV-G expression plasmid), and pONY3. 1 (EIAV gag/pol expression plasmid). 

The structure of pONY8.1Z MLVHyb is shown in Figure 15 and the sequence of this 
plasmid is shown as SEQ ID No 10. 

5 The EIAV promoter was also replaced by the human cytomegalovirus (CMV) promoter 
using a similar strategy. The primers and templates were the same except that KM003 
was replaced by KM008: 
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GGCCATCGTGCCTCCCCACTCCTGCAGTTATAAAACCCCTCATAAAAACCCCA 
CAG (SEQ ID No 11) 

KM004 was replaced by KM009: 

CTGTGGGGTTTTTATGAGGGGTTTTATAAACTGCAGGAGTGGGGAGGCACGA 
TGGCC (SEQ ID No 12) 

KM005 was replaced by KM010: 

GAAGGGACTCAGACCGCAGAATCTGAGTGCCCGGTTCACTAAACGAGCTCTG 
CTTATATAGACC (SEQ ID No 13) and 



KM006 was replaced by KM011 : 



GGTCTATATAAGCAGAGCTCGTTTAGTGAACCGGGCACTCAGATTCTGCG 
GTCTGAGTCCCTTC (SEQ ID No 14) 

The template for the PCR reaction with primers KM009 and KM010 was P ONY2.1LacZ. 
0 This plasmid contains a single CMV promoter. The combined PCR product of 1319 bp 
was digested with Sapl and introduced into the pONY8Z or P ONY8.lZ backbone as 
described above for pONY8Z MLVHyb. 



5 



Mutation of remaining ATG codons in the gag of pONY8Z to ATTG 

The alignment of the sequence of the leader and gag region present in vectors pONY4Z 
(an earlier generation EIAV vector), pONY8Z and a derivative of P ONY8Z in which the 
7 remaining ATG codons are mutated to ATTG is shown in Figure 16. These mutations 
were created by PCR mutagenesis as follows. The template for the PCR reactions was 
pONY8Z and the primers were: 



Fl: CGAGATCCTACAGTTGGCGCCCGAACAG (SEQ ID No 15); 
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R 1 :G AGTTACAATCTTCC AGC AATGGAATG AC AATCCCTC AGCTGCC AGTCCTT 
TTCTTTTACAAAGTTGGTATCAATGAAATAAGTCTACTAGACTTAGC (SEQ id 
No 16); 

5 F2:TTCCATTGCTGGAAGATTGTAACTCAGACGCTGTCAGGACAAGAAAGAGA 
GGCCTTTGAAAGAACATTGGTGGGCAATTTCTGCTGTAAAGATTG (SEQ ID No 
17); 

R2:CAATATTTCGCTCTTAGGAGCTGGAATGATGCCTTTCCAATCTACTACAAT 
10 TATTAATCTGGAGGCCCAATCTTTACAGCAGAAATTGCCCACCAATG (SEQ ID 
No 18); 

R3:CCACTAGTTCTAGAGATATTCTTCAGAGGGCTCAGACTGCTTTTTATTAGC 
AGTCTTCTTTTCAATATTTCGCTCTTAGGAGC (SEQ ID No 19) 

15 

In the first stage of construction two PCR reactions were set up with primer pairs Fl/Rl 
and F2/R2. respectively. These were purified and then used in a second 'overlapping' 
reaction in which primers Fl and R3 were added after 10 cycles. This procedure results 
in a 552bp PCR product (SEQ ID No 20): 

20 

CGAGATCCTACAGTTGGCGCCCGAACAGGGACCTGAGAGGGGCGCAGACCCTACCTGTTGAA 
CCTGGCTGATCGTAGGATCCCCGGGACAGCAGAGGAGAACTTACAGAAGTCTTCTGGAGGTGT 
TCCTGGCCAGAACACAGGAGGACAGGTAAGATTGGGAGACCCTTTGACATTGGAGCAAGGCG 
CTCAAGAAGTTAGAGAAGGTGACGGTACAAGGGTCTCAGAAATTAACTACTGGTAACTGTAAT 

25 TGGGCGCTAAGTCTAGTAGACTTATTTCATTGATACCAACTTTGTAAAAGAAAAGGACTGGCA 
GCTGAGGGATTGTCATTCCATTGCTGGAAGATTGTAACTCAGACGCTGTCAGGACAAGAAAGA 
GAGGCCTTTGAAAGAACATTGGTGGGCAATTTCTGCTGTAAAGATTGGGCCTCCAGATTAATA 
ATTGTAGTAGATTGGAAAGGCATCATTCCAGCTCCTAAGAGCGAAATATTGAAAAGAAGACTG 
CTAATAAAAAGCAGTCTGAGCCCTCTGAAGAATATCTCTAGAACTAGTGG 

30 This was digested with Narl and Xbal and ligated into pONY8Z. pONY8Z MLVHyb and 
pONY8Z CMVHyb. which had been prepared for ligation by digestion with the same 
enzymes. These plasmids were designated pONY8ZA or pONYSZA MLVHyb and 
pONY8ZA CMVHyb. The sequence for pONY8ZA CMVHyb is provided in SEQ ID No 
52. These plasmids have a unique Xbal site into which can be inserted genes such as 
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GFP or neomycin phosphotransferase or other biologically active entity. This use of th 
site is demonstrated for GFP. The GFP ORF was obtained from pEGFP-1 (Clontech) by 
digestion with Smal and Xbal, and then the ends filled in by treatment with T4 DNA 
polymerase. This fragment was then ligated into pONYSZA or pONYSZA MLVHyb and 
5 pONYSZA CMVHyb prepared for ligation by digestion with Xbal and subsequent filling 
in with T4DNA polymerase. The resulting vector plasmids were called pONYSGZA or 
pONYSGZA MLVHyb and pONY8GZA CMVHyb. Other genes can be inserted at this 
site by manipulations apparent to those skilled in the an. 

10 Creation of EIAV vector genomes containing loxP sites in their LTR's 

The time taken to construct producer ceil lines for EIAV vectors would be greatly 
reduced if it was possible to 1) locate and 2) reutilise a site in the host cell chromosome 
which was particularly favourable for high levels of transcription of the vector genome. 
15 In outline, this can be achieved by engineering loxP sites in the 3'LTR of EIAV vectors, 
transduction of the packaging cell line with vectors which carry loxP and hybrid LTRs, 
selection of cells which express the highest levels of vector genome and exchange of the 
test EIAV vector genome for the vector genome of choice using the cre/loxP 
recombination system. 

20 

The proposed scheme was evaluated using a derivative of pONYSGZA CMVHyb in 
which a loxP site was introduced into the PstI site between the EIAV sequences (required 
for efficient integration) and the CMV promoter in the 3'LTR. After transduction the 
integrated vector will thus have a loxP-CMV cassette located in the 5*LTR and 3' LTR's 

25 and therefore full length transcripts of the vector genome will be driven by the CMV 
promoter, which is a powerful promoter. pONY8GZA CMVHyb contains many PstI sites 
hence it was modified to allow insertion of the loxP site by digestion with Xbal and Nhel 
and religation to create the subclone, pONY CMVHyb. This plasmid has a unique PstI 
site in the hybrid LTR. The loxP site was inserted into this site using two complimentary 

30 oligonucleotides which when annealed formed Pstl-compatible termini. These were 
termed loxP POS 
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PSTI [GATAAC TTCGTATAATGTATGCTATACGAAGTTATCTGCA] (SEQ ID No 
21)] and 

loxP NEG PstI [GATAACTTCGTATAGCATACATTATACGAAGTTATCTGCA] 
(SEQ ID No 22) 

The sequence and orientation of the loxP site was confirmed by DNA sequencing and the 
plasmid called pONY CMVHyb loxP. The central part of the vector genome was then 
reintroduced into this subclone by transfer of the Notl-BstEII fragment from pONY8GZA 
CMVHyb into pONY CMVHyb cut the same way. The resulting vector was termed 
pONY8GZA CMVHyb loxP. 

Two routes for construction of a producer cell line are available using pONY8GZA 
CMVHyb loxP. The plasmid can be introduced into a packaging cell line by transfection 
or vector particles can be made using the 293T and these used to transduce the packaging 
cell line. Since the vector is derived from EIAV, rather than MLV, it is able to transduce 
non-dividing cells or slowly dividing cells. In this situation it has been hypothesised that 
integrations occur at chromosomal sites that are constitutively open; that is, are likely to 
be sites at which high levels of transcription will be maintained for extended periods. 
This may be important for the long term usefulness of the producer cell line and thus 
represents an advantage of strategy using transduction. 

Producer cell lines were made by transfection or transduction of a TE671 -derived cell line 
(EVUE) which has stably integrated copies of VSV-G and the synthetic EIAV gag/pol 
under the control of CMV promoters. Prior to transfection with pONY8GZA CMVHyb 
loxP it was linearised by digestion with AhdI. Seven days following transfection or 
transduction cells expressing the highest levels of GFP were selected by FACS and then 
cloned by limiting dilution. A number of clones were analysed for levels of full length 
vector RNA using Taqman technology based assays in order to confirm the hypothesis 
that the highest level of GFP expression correlates with the highest levels of vector RNA. 

The cell line which expressed the highest level of RNA was then tested for production of 
transducing vector particles 5 days after changing the temperature of incubation from 37C 
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to 32°C. At 32°C. VSV-G expression is induced however maximal levels of VSV-G are 
only obtained after 5 days at the permissive temperature (see WO 00/52188). The cell 
line producing the highest titre, EV1 IE CMV loxP was selected for further work. 

5 In order to exchange the vector genome with for another EIAV vector genome with a 
more suitable configuration for use in the clinical setting EVllEloxP cells were 
transfected with ere recombinase expression plasmid, pBS185 (Gibco), which results in 
excision of the EIAV vector between the loxP sites. This leaves a loxP-CMV promoter 
R-U5 sequence in the cells. Cells from which the EIAV vector genome had been excised 
10 were selected on the basis of low levels of GFP expression by FACS and assumed to be 
clonal on the basis of the clonality of EV11E CMV loxP. These were termed 
EV1 lEloxPA and used as targets for new EIAV vector genomes. 

Construction of EIAV vector genomes with downstream REV expression cassettes 
15 and flanking loxP sites 

The production of vector particles from minimal EIAV vectors (those which do not 
express EIAV REV or any other EIAV proteins) is increased by about 10-fold in the 
presence of EIAV REV in our 293T transient production system when the codon- 

20 optimsed EIAV gag/pol expression construct is used to drive production of vector 
particles as set out in Table 2. This may be improved nuclear to cytoplasmic transport of 
the vector genome in the presence of REV protein. Packaging/producer cell lines for 
EIAV vectors may be engineered to express Rev protein. One approach would be to 
engineer cells to express EIAV REV from an independently transfected expression 

25 cassette. However, the cassette and the vector genome may be subject to differential 
regulation, for example by methylation or chromosome remodelling. Such an effect may 
limit the useful life of such cell lines. 

Table 2. Effect of REV expression on titres obtained from REV-expressing [REV+] and 
30 non-expressing [REV-] vectors 



vector plasmid 



gag/pol expression titre 



1 J J! *;J «V» 
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1 


plasmids 


(l.f.u./ml) 


pONY4Z [REV+J 


pONY3.1 


2.0±0.4x 10° 


pONY4Z [REV+] 


pE SYN GP 


0.9+0.2 x 10 6 


pONY8Z [REV-] 


pONY3.I 


1.5+0.2 x 10 b 


pONYSZ [REV-] 


pE SYN GP 


1.9±0.6 x 10* 



"Titre was measured on D17 ceils and is expressed as LacZ forming units/ml (l.f.uVml). 
Transfections were carried out in 293T cells using the vector plasmid and gag/pol 
expression plasmid shown and pRV67 (VSV-G expression plasmid) (See WO 00/52188). 

5 

REV+ and REV- reflect the rev expression status of the vectors. REV+ reflects vectors 
which express the REV protein. REV- reflects expression vectors which do not express 
the REV protein. 

10 pESYNGP 

The gag/pol expression plasmid shown in Figure called called pESYNGP was constructed 
as follows: The codon-optimised EIAV gag/pol ORF was synthesised by Operon 
Technologies Inc., Alameda and supplied in a proprietary plasmid backbone, GeneOp. 
15 The complete fragment synthesised included sequences flanking the EIAV gag/pol ORF: 
tctagaGAATTCGCCACCATG- EIAV gag/pol- UGAACCCGGGgcggccgc (SEQ ID No 
44). The ATG start and UGA stop codons are shown in bold. Xbal and NotI sites are in 
lower case. These were used to transfer the gag/pol ORF from GeneOp into pCIneo 
(Promega) using the Nhel and NotI sites in the latter. 

20 

pESDSYNGP 

An alternative expression plasmid for expression of the synthetic EIAV gag/pol could 
25 also be used. It is called pESDSYNGP and its construction is described as follows: 

ESDSYNGP was made from pESYNGP by exchange of the 306bp EcoRI-Nhel fragment, 
from just upstream of the start codon for gag/pol to approximately 300 base pairs inside 
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the gag/pol ORJF with a 308bp EcoRI-Nhel fragment derived by digestion of a PCR made 
using pESYNGP as template and using the following primers: SD FOR 
[GGCTAGAGAATTCCAGGTAAGATGGGCGATCCCCTCACCTGG] (SEQ ID No 
60) and SD REV [TTGGGTACTCCTCGCTAGGTTC] (SEQ ID No 61). This 
5 manipulation replaces the Kozak concensus sequence upstream of the ATG in pES YNGP 
with the splice donor found in EI A V. The sequence between the EcoRI site and the ATG 
of gag/pol is thus CAGGTAAG (SEQ ID No 62). 

The sequences for pESYNGP (SEQ ID No 53) and pESDSYNGP (SEQ ID No 54) are 
10 provided. 

Packaging/ Producer ceils may be engineered by physically linking the genome and EIAV 
REV expression cassettes. In this way stable transfectants may be generated which 
contain the vector genome and the EIAV REV expression cassette in the same chromatin 
1 5 environment. This manipulation may ensure that the relative levels of transcription of the 
vector genome and the REV expression cassette are maintained leading to an increased 
duration of vector production from the producer cells. 

Previous work has suggested that optimisation of the level of REV may be required with 
20 respect to the level of vector genome (see WO 98/17815). We have examined the levels 
of vector production in a transient system in which several different promoters are used to 
drive REV expression in order to determine which vector genome-rev expression cassette 
is optimal for use in constructing producer cell lines. The highest titres were obtained 
with FB29 and PGK promoters driving REV expression. 

25 

The following describes the construction of EIAV vector genomes plasmids in which 
there is a downstream expression cassette for synthetic EIAV REV protein. The 
promoters tested were FB29, PGK, TK, CiMV, SV40 and RSV. In addition the loxP sites 
were engineered into the vector plasmid backbone in such a way that the genome and 
30 introduced promoter-REV expression plasmid was flanked. In this way, the complete 
vector-REV cassette can be recombined into loxP sites in the target cell. 
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The complete construction of the FB29 and PGK containing plasmids is described here. 
The REV expression construct was inserted in the both orientations with respect to the 
EIAV vector genome, Plasmids in which the FB29 or PGK promoters drive REV 
expression are being utilised for construction of stable producer cell lines. 

5 

Construction of plasmids 

In the first step of construction an Sfil site was inserted downstream of the EIAV vector 
sequence. This site is the insertion site for the promoter REV cassettes. The construction 
10 was made as follows. pONY8Z was digested with Ehel and Nrul, the ends were blunted 
by treatment with T4 DNA polymerase and religated. The resulting plasmid, pONY8Z 
delta, is thus deleted with respect to the leader, gag, reporter cassette and most of the 
Rev/RRE regions. 

15 pONY8Z delta was mutated to contain loxP sites inserted in the Drall site immediately to 
the 5 7 of the CMV promoter and in the BspLUl II site to the 3* of the vector genome. The 
loxP sites were inserted using complementary nucleotide pairs which when annealed had 
overhanging termini suitable for cloning into these sites and were inserted in two steps of 
cloning. The oligonucleotides for insertion into the Dralll site were 

20 

VSAT 158: [GTGATAACTTCGTATAATGTATGCTATACGAAGTTATCACTAC] 
(SEQ ID No 23) 

and 

25 

VSAT 155 [GTGATAACTTCGTATAGCATACATTATACGAAGTTATCACGTA] 
(SEQ ID No 24) 

For the BspLUl 1 1 they were: 
30 VSAT 1 56 [CATGTATAACTTCGTATAATGTATGCTATACGAAGTTATA] (SEQ ID 
No 25) and 
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VSAT 157 [CATGTATAACTTCGTATAGCATACATTATACGAAGTTATA] (SEQ ID 
No 26) 

Plasmids were selected in which the orientation of the loxP at both sites were the same 
and the same as the EIAV vector genome. The modified plasmid was called pONY8Z 
delta 2xloxP. 

pONYSZ delta 2xloxP has a unique PvuII site downstream of the deleted EIAV vector 
genome into which annealed complementary oligonucleotides encoding Sfil sites were 
inserted. The oligonucleotides were: 

SFI SRFPOS [AGTAGGCCGCCTCGGCCGCCCGGGCATCA] (SEQ ID No 27) and 

SFI SRF NEG [TGATGCCCGGGCGGCCGAGGCGGCCTACT] (SEQ ID No 28) 

Clones which had the Sfil - Srfl sites in either orientation were selected for further work. 
These were called pONY8Z delta Sfil FOR and REV. 

Amplification and cloning of FB29 and PGK promoters 



The FB29 promoter was amplified from pRDF (Cosset FL, et alJ Virol 1995 
Dec;69(12):7430-6) using primers: 

FB29 POS [TAGCCGAGATCTCAAATTGCTTAGCCTGATAGCC] (SEQ ID No 
29)and 

5 FB29 NEG [TGCGTAGCTAGCCTCCCGGTGGTGGGTCGGTG] (SEQ ID No 30) 
which introduce 

5'BglII and 3'NheI sites. 

The PGK promoter was amplified from pPE327 using primers 

PGK POS [AGCAGTAGATCTGGGGTTGGGGTTGCGCCTTT] (SEQ ID No 31) and 
0 PGK NEG [CGTCATGCTAGCCTGGGGAGAGAGGTCGGTG] (SEQ ID No 32) 
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The PGK promoter sequence obtained from this plasmid was the same as the sequence of 
GenBank Acc. No. Ml 1958 except that it has a single mutation: nucleotide 347 of 
Ml 1958 is changed from G to A. The TK promoter and intron was amplifed from pRL- 
TK (Promega) with: 

5 TK POS [TACGGAAGATCTAAATGAGTCTTCGGACCT] (SEQ ID No 33) and 
TK NEG [CTCAACGCTAGCGTACTCTAGCCTTAAGAGCTG] (SEQ ID No 34) 

The RSV promoter was amplifed from pREP7 (Invitrogen) with 

10 RSV POS [TACCAGAGATCTTCTAGAGTCGACCAATTCTCATG] (SEQ ID No 35) 
and 

RSV NEG [CATCGAGCTAGCAGCTTGGAGGTGCACACCAATG] (SEQ ID No 36) 
and 

15 

The SV40 promoter was amplifed from pCIneo (Promega) with: 

SV40 POS [GATGGTAGATCTGCGCAGCACCATGGCCTGAA] (SEQ ID No 37) and 

20 SV40 NEG [CTCGAAGCTAGCAGCTTTTTGCAAAAGCCTAGGC] (SEQ ID No 38) 

The PCR fragments were digested with Bglll and Nhel and ligated into pSL1180 
(Pharmacia) which had been prepared by digestion using the same enzymes. Following 
transformation into E.coli DNA was prepared and the sequence of the promoters checked 
25 by DNA sequencing. Clones in which the correct promoter sequence was present were 
used for further work and were called pSLl 180-FB29, pSL1180-PGK, pSL1180-RSV, 
pSL 1 1 80-SV40, pSL 1 1 80-TK. 

In the next step the promoter fragments were positioned to drive transcription of synthetic 
30 EIAV REV in pE syn REV. pE syn REV is a pCIneo based plasmid (Promega) which 
was made by introducing the EcoRI to Sail fragment from the synthetic EIAV REV 
plasmid into the polylinker region of the plasmid using the same sites. The synthetic 
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EIAV REV plasmid made by Operon contains a codon-optimised EIAV RJEV open 
reading frame flanked by EcoRI and Sail. The sequence of this fragment is shown as 
SEQ ID No 39. 

Prior to replacement of the CMV promoter in pE syn REV it was modified as follows. 
The SV40 neo region was deleted by digestion with Kpnl and BamHL the ends blunted 
by treatment with T4 DNA polymerase and then religated. The plasmid was termed pE 
syn REV delta. Next Sfil sites were introduced into both the BgUI site which is just 5' of 
the CMV promoter and Dralll site downstream of the poly A signal. 

The oligonucleotides used for this were as follows: 

SFI FOR Bgill POS [GATCGGCCGCCTCGGCCA] (SEQ ID No 40) and 
SFI FOR BgUI NEG [ GATCTGGCCGAGGCGGCC] (SEQ ID No 41)and 
SFI FOR DRA POS [ GGCCGCCTCGGCCGTA] (SEQ ID No 42) and 
SFI FOR DRA NEG [GGCCGAGGCGGCCTAC] (SEQ ID No 43) 

Clones in which the Sfil was located 5' of the BgUI site were selected were used for 
further work. The plasmid obtained after this two step manipulation was termed pE syn 
REV delta 2xSfiI. It has the following features: 5' Sfil sites - BgUI site - CMV promoter 
and intron - Nhel site - E syn REV - polyA site - 3'SfiI site. Hence the CMV promoter 
can be excised by digestion with BgUI and Nhel and replaced with the promoter of choice 
obtained from the pSL1180 series of clones by digestion with the same enzymes. 
Construction details are included from this point for only the constructs which contained 
FB29 and PGK promoters, however a similar scheme was used for the other promoters, 
except that a partial Sfil digestion was required for transfer of the SV40-REV cassette. 

Promoter fragment were obtained from pSL1180 - FB29 and pSL1180 - PGK by 
digestion with BgUI and Nhel and ligated into pE syn REV delta 2xSfiI digested with the 
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same enzymes. The resulting plasmids were called FB29 E SYN REV and PGK E SYN 
REV. 

In the next stage the internal regions of pONY8G, pONYS.lG SIN MIN and pONY4G 
were obtained by digestion with Sgfl (which has unique site in the CMV promoter driving 
the EIAV vector genome) and Muni (which cuts in the 3'LTR) and Iigated in to pONYSZ 
delta Sfil FOR and REV prepared for ligation by digestion with the same enzymes. The 
resulting plasmids were called pONY8G Sfil FOR and REV, pONYS.lG SIN MIN Sfil 
FOR and REV and pONY4G Sfil FOR and REV. 

In the final stage the promoter-REV cassettes were moved from FB29 E SYN REV and 
PGK E SYN REV into pONY8G Sfil FOR and REV, pONY8.1G SIN MIN Sfil FOR and 
REV and pONY4G Sfil FOR. This manipulation was achieved as follows. FB29 E syn 
REV, PGK E syn REV ; and the vector plasmids described immediately above were 
digested with Sfil and ligations set up with appropriates fragments. The promoter-REV 
cassettes were orientated in the same or opposite orientations with respect to the EIAV 
vector genome in the 4 FOR' and C REV plasmids The resulting plasmids were called 
pONY8.3G FB29 + or- and pONY8.3G PGK+ and A schematic structure of the 
pONY 8.3 +•/- plasmids is shown in Figure 17. 

The performance of these constructs was tested in relation to pONY8G in 293T transient 
production assays and the results are shown in Table 3. 



The sequence of the EcoRl to Sail fragment representing the codon-optimised EIAV 
REV open reading frame obtained from the plasmid synthesised by Operon (SEQ 
ID No 39) 

EcoRJ and Sail sites are in bold. The ATG start and UGA termination codons are underlined 



GAATTCGCCACC ATG GCTGAGAGCAAGGAGGCCAGGGATCAAGAGATG 
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ACCTCAAGGAA 

GAGAGCAAAGAGGAGAAGCGCCGCAACGACTGGTGGAAGATCGACCCA 
AAGGCCCCCTG 

GAGGGGGACCAGTGGTGCCGCGTGCTGAGACAGTCCCTGCCCGAGGAGAAGATTCCT 
AGC 

CAGACCTGCATCGCCAGAAGACACCTCGGCCCCGGTCCCACCCAGCACACACCCTCC 
AGA 

AGGGATAGGTGGATTAGGGGCCAGATTTTGCAAGCCGAGGTCCTCCAAGAAAGGCTG 
GAA 

TGGAGAATTAGGGGCGTGCAACAAGCCGCTAAAGAGCTGGGAGAGGTGAATCGCGG 
CATC 

TGGAGGGAGCTCTACTTCCGCGAGGACCAGAGGGGCGATTTCTCCGCATGGGGAGGC 
TAC 

CAGAGGGCACAAGAAAGGCTGTGGGGCGAGCAGAGCAGCCCCCGCGTCTTGAGGCC 
CGGA 

GACTCCAAAAGACGCCGCAAACACCTGTQAAGTCGAC 



Table 3 

Titres obtained from a representative experiment in which the vector-REV constructs 
were tested by transient 293T production assay. The vector constructs were cotransfected 
with pE synGP, the synthetic EIAV gag/pol expression piasmid, and pRV67, VSV-G 
expression piasmid. Titres were measured in D17 cells. 



Piasmid 


Titre 

(g.f.u./ml) 


P ONY8G Sfil FOR 


1.6xl0 4 


#pONY8G Sfil FOR 


5.2xl0 3 


PLUS pE syn REV 




P ONY8.3G FB29 + 


8.8x1 0 3 


pONY8.3G FB29 - 


7.8xlO J 


P ONY8.3G PGK + 


1.2xlO b 


pONY8.3G PGK - 


1.2x10° 


pONY8G 


9.4xl0 ? 



t O OS U O /> if, 
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Titre was assessed on D 17 cells and is expressed as green fluorescent protein cell units/ml 
(g.f.u./ml). Transfections were carried out with pE syn GP KOZAK and pRV67 as 
described previously. 

5 * pONYSG Sfil FOR is identical to the pONY8.3 derivatives except that there is no 
promoter-REV expression cassette is inserted in the Sfil site 

# pE syn REV plasmid was also included in this transfection 
10 pONY8G is a standard EIAV vector genome used for comparative purposes 

pONY8.3G FB29 - is shown as SEQ ID No 45 

pONY8 .3G FB29 + is shown as SEQ ID No 46 
15 pONY8.3GPGK - is shown as SEQ ID No 47 

pONY8.3G PGK + is shown as SEQ ID No 48. 

SUMMARY 

20 

Thus, in summation, the present invention provides high titer regulated retroviral vectors. 
These regulated retroviral vectors include lentivectors, HRE-reguIated vectors and 
functional SIN vectors which can be produced at high titres from derived producer cell 
lines. 

25 

The present invention also provides a method other than retroviral transduction for the 
transfer of a regulated retroviral vector into a derived producer cell line. This method 
comprises a recombinase assisted method which allows for the production of high titer 
regulated retroviral vectors. 

30 

In one broad aspect, the present invention relates to the selection of cells which express 
high levels of a retroviral vector genome and exchange of this retroviral genome for the 
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vector genome of choice, preferably a regulated retroviral vector genome or a lentiviral 
vector genome using a cre/loxP recombination system. Thus, the present invention 
enables regulated retroviral vectors to be produced at high titres from transduced producer 
cell lines. 

5 

In another broad aspect, the present invention relates to the selection of cells which 
express high levels of a retroviral vector genome and exchange of this retroviral genome 
for the vector genome of choice, preferably a regulated retroviral vector genome or a 
lentiviral vector genome using a cre/loxP recombination system and a retroviral vector 
10 producton system which incorporates a REV protein production system. Thus, the 
present invention enables regulated retroviral vectors to be produced at high titres from 
transduced producer cell lines. 



All publications mentioned in the above specification are herein incorporated by 
1 5 reference. Various modifications and variations of the described methods and system of 
the invention will be apparent to those skilled in the art without departing from the scope 
and spirit of the invention. Although the invention has been described in connection with 
specific preferred embodiments, it should be understood that the invention as claimed 
should not be unduly limited to such specific embodiments. Indeed, various 
20 modifications of the described modes for carrying out the invention which are obvious to 
those skilled in molecular biology or related fields are intended to be within the scope of 
the following claims. 
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SEQUENCE LISTING PART OF THE DESCRIPTION 
PONY8.1Z MLVHyb (SEQ ID NO 10) 

5 agatcttgaataataaaatgtgtgtttgtccgaaatacgcgttttgagatttc 
atagtggtgtttatcgccgatagagatggcgatattggaaaaattgatat^^ 

cgatgtgagtttctgtgtaactgata tcgcca'fttttccaaaagtgatttttgggcatacgcgatatctggcgatagcgc 
ttatatcgtttacgggggatggcgatagacgactttggtgacttgggcgattctgtgtgtcgcaaa ratcgcagtttcga 
tataggtgacagacgatatgaggctatatcgccgatagaggcgacatc.a,agctggcacatggccaatgcatatcgatc 

10 tat ac attg aatc aat attg gc c a tt ag cc at att attc attg gtt at a t ag c at aaatc a a t a ttg gct attg g cc att 
gcatacgttgtatccatatcgtaatatgtacatttatattggctcatgtccaacattaccgccatgttgacattgattatt 
g act agtt att aatagt aatc aattacggggtcattagttcatagcccatatatggagttccgcgttacataacttacgg 
taaatggcccgcctggctgaccgccc/vacgacccccgcccattgacgtcaataatgacgtatgttcccatagtaacgcc 
aatagggactttccattgacgtc.a-atgggtggagtatttacggtaaactgcccacttggcagtacatcaagtgtatcat 

15 atgccaagtccgccccctattgacgtcaatgacggtaaatggcccgcctggcattatgcccagtacatgaccttacggg 
actttcctacttggcagtacatctacgtattagtcatcgctattaccatggtgatgcggtrrtggcagtacacc.aatggg 
cgtggatagcggtttgactcacggggatttccaagtctccaccccattgacgtc.aatgggagtttgttttggcacca 

ATCAACGGGACTTTCCAAAATGTCGTAACAACTGCGATCGCCCGCCCCGTTGACGC/VAATGGGCGGTAGGCGTGTaCGG 
TGGGAGGTCTATATAAGCAGAGCTCGTTTAGTGAACCGGGCACTCAGATTCTGCGGTCTGAGTCCCTTCTCTGCTGGGCT 

20 GAAAAGGCCTTTGTAATAAATATAATTCTCTACTCAGTCCCT 

CCGAACAGGGACCTGAGAGGGGCGCAGACCCTACCTGTTGAACCTGGCTGATCGTAGGATCCCCGGGACAGCAGAGGA 
GAACTTACAGAAGTCTTCTGGAGGTGTTCCTGGCCAGAACACAGGAGGACAGGTAAGATTGGGAGACCCTTTGACATT 
GGAGCAAGGCGCH'CAAGAAGTTAGAGAAGGTGACGGTACAAGGGTCTCAGAAATTAACTACTGGTAACTGTAATTGGG 
CGCTAAGTCTAGTAGACTTATTTCATGATACCAACTTTGTAAAAGAAAAGGACTGGCAGCTGAGGGATGTCATTCCATT 

25 GCTGGAAGATGTAACTCAGACGCTGTCAGGACAAGAAAGAGAGGCCnTTGAAAGAACATGGTGGGCAATTTCTGCTGT 
AAAGATGGGCCTCCAGATTAATAATGTAGTAGATGGAAAGGCATCATTCCAGCTCCTAAGAGCGA.AATATGAAAAGA.A 
GACTGCTAATAAAAAGCAGTCTGAGCCCTCTGAAGAATATCTCTAGAACTAGTGGATCCCCCGGGCTGCAGGAGTGGG 
GAGGCACGATGGCCGCnTTGGTCGAGGCGGATCCGGCCATTAGCCATATTATTCATTGGTTATATAGCATAAATCAATA 
TTGGCTATTGGCCATTGCATACGTTGTATCCATATCATAATATGTACATTTATATTGGCTCATGTCCAACATTACCGCCAT 

30 GTTGACATTGATTATrGACTAGTTATTAATAGTAATCAATTACGGGGTCATTAGTTCATAGCCCATATATGGAGTTCCGC 
GTTACATAACTTACGGTAAATGGCCCGCCTGGCTGACCGCCCAACGACCCCCGCCCATTGACGTCAATAATGACGTATG 

ttcccatagtaacgccaatagggacttrccattgacgtcaatgggtggagtatttacggta.aactgcccacttggcagt 
acatcaagtgtatcatatgcca,agtacgccccctattgacgtcaatgacggta.aatggcccgcctggcattatgcccag 
tacatgaccitatgggactttcctacrrggcagtacatctacg 
35 gcagtacatcaatgggcgtggatagcggtttgactcacggggatttccaagtctccaccccattgacgtcaatgggagt 
ttgttttggcaccaaaatcaacgggactitccaaaatgtcgtaacaactc^ 

atgtacggtgggaggtctatataagcagagctcgtttagtgaaccgtcagatcgcctggagacgccatccacgctgttt 
tgacctccatagaagacaccgggaccgatccagcctccgcggccccaagcttcagctgctcgaggatctgcggatccgg 
ggaattccccagtctcaggatccaccatgggggatcccgtcgrmacaacgtcgtgactgggaaaaccctggcgttac 
40 ccaacttaatcgccttgcagcacatcccccmcgccagctggcgtaatagcgaagaggcccgcaccgatcgcccttcc 
caacagttgcgcagcctgaatggcgaatggcgctttgccrggtttccggcaccagaagcggtgccggaaagctggctg 
gagtgcgatcrrcctgaggccgatactgtcgtcgtccccrcaaactggcagatgcacggttacgatgcgcccatctaca 
ccaacgtaacctatcccatracggtcaatccgccgtttgttcccacggagaatccgacgggttgttactcgctcacattt 

.AATGTTG ATG AAAGCTG GCT AC AGG AAG GCC AG AC GCG AATT A ITTl rGATGGCGTTAACTCGGCGTTTCATCTGTGGT 
45 GCAACGGGCGCTGGGTCGGrrACGGCCAGGACAGTCGTTTGCCGTCTG.AATTTGACCTGAGCGCATTTTTACGCGCCGG 
AGAAAACCGCCTCGCGGTGATGGTGCTGCGTTGGAGTGACGGCAGTTATCTGGAAGATCAGGATATGTGGCGGATGAG 
CGGCATTTTCCGTGACGTCTCGTTGCTGCaTAAACCGACTACACAAATC^ 

ATGATTTCAGCCGCGCTGTACTGGAGGCTGAAGTTCAGATGTGCGGCGAGTTGCGTGACTACCTACGGGT.AACAGTTTC 
TTTATGGCAGGGTGAAACGCAGGTCGCCAGCGGCACCGCGCCTrTCGGCGGTGAAATTATCGATGAGCGTGGTGGTTAT 
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gccgatcgcgtcacactacgtctcaacgtcgaaaacccg,w\actgtggagcgccgaaatcccg.-v\tctc:tatcgtgcgg 
tggttgaactgcacaccgccgacggcacgctgattgaagcagaagcctgcgatgtcggtttccgcgaggtgcggattga 
aaatggtctgctgctgctgaacggcaagccgttc^ 

caggtcatggatgagcagacgatggtgcaggatatcctgctgatg/\agcagaacaactttaacgccgtgcgctgttcgc 
5 attatccgaaccatccgctgtggtacacgctgtgcgaccgctacggcctgtatgtggtggatgaagccaatattgaaac 

CCACGGCATGGTGCCA.ATGAATCGTCTGACCGATGATCCGCGCTGGCTACCGGCGATGAGCGAACGCGTAACGCG.\AT 
GGTGCAGCGCGATCGTAATCACCCGAGTGTGATCATCTGGTCGCTGGGGAATGAATCAGGCCACGGCGCTAATCACGA 
CGCGCTGTATCGCTGGATCAAATCTGTCGATCCTTCCCGCCCGGTGCAGTATGAAGGCGGCGGAGCCGACACCACGGCC 
ACCGATATTATTTGCCCGATGTACGCGCGCGTGGATGAAGACCAGCCCTTCCCGGCTGTGCCGAAATGGTCCATCAAA-A 

10 aatggctttcgctacctggagagacgcgcccgctgatcctttgcga^tacgcccacgcgatgggtaacagtcttggcgg 
tttcgctaa.atactggcacgcgtttcgtcagtatccccgtttacagggcggcttcgtctgggactgggtggatcagtcgc 
tgattaaatatgatgaaaacggcaacccgtggtcggcttacggcggtgarm 

ctgtatgaacggtctggtctttgccgaccgcacgccgcatccagcgctgacggaagcaaaacaccagcagcag \ i n ic 

CAGTTCCGTTTATCCGGGCAAACCATCGAAGTGACCAGCGA-ATACCTGTTCCGTCATAGCGAT/VACGAGCTCCTGCACT 
15 GGATGGTGGCGCTGGATGGTAAGCCGCTGGCAAGCGGTGAAGTGCCTCTGGATGTCGCTCCACAAGGTAAACAGTTGA 
TTGA.ACTGCCTG.AACTACCGCAGCCGGAGAGCGCCGGGCAACTCTGGCTCACAGTACGCGTAGTGCA.ACCGAACGCGA 
CCGCATGGTCAGAAGCCGGGCACATCAGCGCCTGGCAGCAGTGGCGTCTGGCGGAAAACCTCAGTGTGACGCTCCCCG 
CCGCGTCCCACGCCATCCCGCATCTGACCACCAGCGAAA7GGATTTTTGCATCGAGCTGGGTAATAAGCGTTGGCAATT 
TAACCGCCAGTCAGGCTTTCTTTCACAGATGTGGATTGGCGATAAAAAACAACTGCTGACGCCGCTGCGCGATCAGTTC 
20 ACCCGTGCACCGCTGGATAACGACATTGGCGTAAGTGAAGCGACCCGCATTGACCCTAACGCCTGGGTCGAACGCTGG 
.AAGGCGGCGGGCCATTACCAGGCCGAAGCAGCGTTGTTGCAGTGCACGGCAGATACACTTGCTGATGCGGTGCTGATT 
ACGACCGCTCACGCGTGGCAGCATCAGGGGAAAACCTTATTTATCAGCCGGAAAACCTACCGGATTGATGGTAGTGGTC 
AAATGGCG ATT ACCGTTG ATGTTG AAG TGGCG AGCG AT AC ACCGC ATCCGGC GCG G ATTGGCCTG AACTGCC AGCTG G C 
GCAGGTAGCAGAGCGGGTAAACTGGCTCGGATTAGGGCCGCAAGAAAACTATCCCGACCGCCTTACTGCCGCCTGTTrT 
25 GACCGCTGGGATCTGCCATTGTCAGACATGTATACCCCGTACGTCTTCCCGAGCGAAAACGGTCTGCGCTGCGGGACGC 
GCGAATTGAATTATGGCCCACACCAGTGGCGCGGCGACTTCCAGTTCAACATCAGCCGCTACAGTCAACAGCAACTGAT 

ggaaaccagccatcgccatctgctgcacgcggaagaaggcacatggctgaatatcgacggtttccatatggggattgg 
tggcgacgactcctggagcccgtcagtatcggcggaattccagctgagcgccggtcgctaccattaccagttggtctgg 
tgtcaaaaataataataaccgggcaggggggatccgcagatccggctgtggaatgtgtgtcagttagggtgtggaaag 
30 tccccaggctccccagcaggcagaagtatgcaaagcatgcctgcaggaattcgatatcaagcttatcgataccgtcgaa 

TTG G AAG AGCTTT AAATC CTG GC AC ATCTC ATGT ATC AATGCCTC AGT ATGTTT AG AAAAAC AAG G G G G G AACTGTGGG 

gtttrratgaggggttttataatgaaagaccccacctgtaggtttggcaagctagcttaagtaacgcc 

catggaaaaatacataactgaga.atagaga.agttcagatcaaggtcaggaacagatggaacagctgaatatgggcca 

aacaggatatctgtggtaagcagttcctgccccggctcagggccaagaacagatggaacagctgaatatgggccaaac 
35 aggatatctgtggtaagcagttcctgccccggctcagggccaagaacagatggtccccagatgcggtccagccctcagc 

agtttctagagaaccatcagatgtttccagggtgccccaaggaccrgaa^ 

tcagttcgcttctcgcttctgttcgcgcgcttct^ 

cactcagattctgcggtctgagtcccttctctgctgggctgaaaaggc(^^ 

ctgtcrctagtttgtctgttcgagatcctacagagct^ 
40 attgtt atc c gctc ac aattcc ac ac aac at ac g agccg g aagc at aaagtgt aaagcctggg gtgcctaatg agtg ag 

ctaactcacattaattgcgttgcgctcactgcccgctttccagtcggga^ 

gccaacgcgcggggagaggcggtttgcgtattgggcgctcttccgcttcctcgctc 

cggctgcggcgagcggtatcagctcactcaaaggcggtaatacggttatccacagaatcaggggat^cgcaggaaag 

AACATGTGAGCaAAAGGCCAGCAAAAGGCCAGGAACCGTAAAAAGGCCGCG 

45 ccccctgacgagcatcacaaaaatcgacgctcaagtcagaggtggcgaaacccgacaggactataaagataccaggcg 
ttrccccctggaagctccctcgtgcgctctcctgt^^ 

ggaagcgtggcgcrttctcatagctcacgctgtaggtatctcagttcggtgtaggtcgttcgctccaagctgggctgtgt 
gcacgaaccccccgttcagcccgaccgctgcgccttatccggtaactatcgtcttgagtccaacccggtaagacacgac 
ttatcgccactggcagcagccactggtaacaggattagcagagcgaggtatgtaggcggtgctacagagttcttga-agt 
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GGTGGCCTAACTACGGCTACACTAG.AACGACAGTATTTGGTATCTCCGCTCTGCTGAAGCCAGTTACCTrcGGAAA/VAG 
AGTTGCTAGCTCTTGATCCGGeAAACAAACCACCGCTGGTAGCGGTC 
AGAAAAAAAGGATCTCAAGAAGATCOTTGATCTTTTCTACGGGGTCTC^ 
GGATTTTGGTCATGAGATTATCAAAAAGGATCTTCACCTAGATCCTTTrA^ 
5 AGTATATATGAGTA.\.ACTTGGTCTGACAGTTACCA.\TGCTTAATCAGTGAGGCACCTATCTCAGCGATCTGTCTATTTCG 
TTCATCCATAGTTGCCTGACTCCCCGTCGTGTAGATAACTACGATACGGGAGGGCTTACCATCTGGCCCC\AGTGCTG_CA 
ATGATACCGCGAGACCCACGCTCACCGGCTCCAGATTTATCAGCAAT/WXCCAGCCAGCCGGAAGGGCCGAGCGCAGA 
AGTGGTCCTGCAACTTTATCCGCCTCCATCCaGTCTATTAA'TO 

TAGTTTGCGCAACGTTGTTGCCATTGCTACAGGCATCGTGGTGTCACGCTCGTCGTTTGGTATGGCTTCATTCAGCTCCG 
10 GTTCCCAACGATCAAGGCGAGTTACATGATCCCCCATGTTGTGCAAAAAAGCGGTTAGCT^ 

GTCAGAAGT.AAGTTGGCCGCAGTGTTATCACTCATGGTTATGGCAGCACTGCATAATTCTCTTACTGTCATGCCATCCGT 

AAGATGCTTTTCTGTGACTGGTGAGTACTCAACCAAGTCATTCTGAGA^TAGTGTATGCGGCGACCGAGTTG 

CGGCGTCAATACGGGATAATACCGCGCCACaTaGCaGAACTTTAAAAGTGCTCATCAT^^ 

AAAACTCTCAAGGATCTTACCGCTGTTGAGATCCAGT^^ 
15 TTACTTTCACCAGCGTTTCTGGGTGAGCAAAAACAGGAA^ 

AATGTTG AATACTCATACTCTTCCTTTTTCAA ^ 

TTGAATGTAITTAGAAAAATAAACAAATAGGGGTTCC^ 

AATATTTTGTTAAAATTCGCGTTAAA'1 I 1 T I GTTAAATC AGCTC ATTTTTT AACCAATAGGCCG AAATCGGCAAAATCCC 
ITATAAATCAAAAGaATAGACCGAGATAGGGTTGAGTGTTGTTCCAGTTTGGAAC 
20 GACTCCAACGTCAAAGGGCGAAAAACCGTCTATCAGGGCGATGGCCCACTACGTGAACCATCACCCTAATCAAGTTTTT 
TGGGGTCGAGGTGCCGTAAAGCACT/\AATCGGAACCCT.A.AAGGGAGCCCCCGATTTAGAGCTTGACGGGG.AAAGCCAA 
CCTGGCTTATCGAAATTAATACGACTCACTATAGGGAGACCGGC 

pONY8.3G FB29 - (SEQ ID No 45) 

25 

agatcttgaataataaaatgtgtgtttgtccgaaatacgcgttttgagatttctgtcgcc 

gactaaattcatgtcgcgcgatagtggtgtttatcgccgatagagatggcgatattggaa 

aaattgatatttgaaaatatggcatattgaaaatgtcck:cgatgtgagtttctgtgt.aac 

tgatatcgccatttttccaaaagtgatttttgggcatacgcgatatctggcgatagcgct 

tatatcgtttacgggggatggcgatagacgactttggtgacttgggcgattctgtgtgtc 

gcaaatatcgcagtttcgatataggtgacagacgatatgaggctatatcgccgatagagg 

cgacatcaagctggcacatggccaatgcatatcgatctatacattgaatcaatattggcc 

attagccatattattcattggttatatagcataaatcaatattggctattggccattgca 

tacgttgtatccatatcgtaatatgtacatttatattggctcatgtccaacattaccgcc 

atgttgacattgattattgactagttattaatagtaatcaattacggggtcattagttca 

tagcccatatatggagttccgcgttacataacttacggtaaatggcccgcctggctgacc 

gcccaacgacccccgcccattgacgtcaataatgacgtatgttcccatagtaacgccaat 

AGG G ACTTTCC ATTG ACGTC AATGG GTG G AGT ATTT ACGG T AAACTGCCC ACTTG GC AGT 

AC ATCAAGTGTATC AT ATGCCAAGTCCGCCCCCTATTGACGTCAATGACGGTAAATGGCC 

CGCCTGGCATTATGCCCAGTACATGACCTTACGGGACTTTCCTACTTGGCAGTACATCTA 

CGT ATT AGTC ATC GCT ATT AC C ATGGTG ATGCG GTTTTG G C AGT AC ACC AATGGGCGTG G 

ATAGCGGTTTGACTCACGGGGATTTCCAAGTCTCCACCCCATTGACGTCAATGGGAGTTT 

GTTTTGGCACCAAAATCAACGGGACTTTCCAAAATGTCGT.AACAACTGCGATCGCCCGCC 

CCGTTGACGCAAATGGGCGGTAGGCGTGTACGGTGGGAGGTCTATATAAGCAGAGCTCGT 

TTAGTGAACCGGGCACTCAGATTCTGCGGTCTGAGTCCCTTCTCTGCTGGGCTGAAAAGG 

CCTTTGT AAT AAAT AT AATTCTCT ACTC AGTCCCTGTCTCT AGTTTGTC TGTTCG AG ATC 

CTACAGTTGGCGCCCGAACAGGGACCTGAGAGGGGCGCAGACCCTACCTGTTGAACCTGG 

CTGATCGTAGGATCCCCGGGACAGCAGAGGAGA.ACTTACAGAAGTCTTCTGGAGGTGTTC 
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CTGGCCAGAACACAGGAGGACACGTAAGATTGGGAGACCCTTTGACATTGGAGCAAGGCG 

CTCAAGAAGTTAGAGAAGGTGACGGTACAAGGGTCTCAGAAATTAACTACTGGTAACTGT 

AATTGGGCGCTAAGTCTAGTAGACTTATTTCATGATACCAACTTTGTAAAAGAAAAGGAC 

TGGCAGCTGAGGGATGTCATTCCATTGCTGGAAGATGTAACTCAGACGCTGTCAGGACAA 

GA.^AGAGAGGCCTTTGAAAGAACATGGTGGGCAATTTCTGCTGT^AGATGGGCCTCCAG 

ATTAATAATGTAGTAGATGGA.\AGGCATCATTCCAGCTCCT,A.AGAGCGA.A.ATATGAAAAG 

AAGACTGCTAATA.A.AAAGCAGTCTGAGCCCTCTG.AAGAATATCTCTAGAACTAGTGGATC 

CCCCGGGCTGCAGGAGTGGGGAGGCACGATGGCCGCTTTGGTCGAGGCGGATCCGGCCAT 

TAGCCATATTATTCATTGGTTATATAGCATA.AATC.AATATTGGCTATTGGCCATTGCATA 

C GTTG T ATCC AT ATC AT AAT ATGT AC ATTT AT ATTG GCTC ATGTCC A AC ATT ACCGCC AT 

GTTGACATTGATTATTGACTAGTTATT.^ATAGT.A.\TCAATTACGGGGTCATTAGTTCATA 

GCCCATATATGGAGTTCCGCGTTACAT^CTTACGGTAAATGGCCCGCCTGGCTGACCGC 

ccaacgacccccgcccattgacgtcaataatgacgtatgttcccatagtaacgccaatag 
ggactttccattgacgtcaatgggtggagtatttacggtaaactgcccacttggcagtac 
atcaagtgtatcatatgccaagtacgccccctattgacgtcaatgacggtaaatggcccg 

CCTGGCATTATGCCCAGTACATGACCTTATGGGACTTTCCTACTTGGCAGTACATCTACG 

TATTAGTCATCGCTATTACCATGGTGATGCGGTTTTGGCAGTACATCAATGGGCGTGGAT 

AGCGGTTTGACTCACGGGGATTTCCAAGTCTCCACCCCATTGACGTC.AATGGGAGTTTGT 

TTTGGCACCAAAATCAACGGGACTTTCCAAAATGTCGTAACAACTCCGCCCCATTGACGC 

AAATGGGCGGTAGGCATGTACGGTGGGAGGTCTATATAAGCAGAGCTCGTTTAGTGAACC 

GTCAGATCGCCTGGAGACGCCATCCACGCTGTTTTGACCTCCATAGAAGACACCGGGACC 

GATCCAGCCTCCGCGGCCCCAAGCTTGTTGGGATCCACCGGTCGCCACCATGGTGAGCAA 

GGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGGTCGAGCTGGACGGCGACGTAAA 

cggccacaagttcagcgtgtccggcgagggcgagggcgatgccacctacggcaagctgac 

cctgaagttcatctgcaccaccggcaagctgcccgtgccctggcccaccctcgtgaccac 

cctgacctacggcgtgcagtgcttcagccgctaccccgaccacatgaagcagcacgactt 

cttcaagtccgccatgcccgaaggctacgtccaggagcgcaccatcttcttcaaggacga 

cggcaactacaagacccgcgccgaggtgaagttcgagggcgacaccctggtgaaccgcat 

cgagctgaagggcatcgacttcaaggaggacggcaacatcctggggcacaagctggagta 

caactacaacagccacaacgtctatatcatggccgacaagcagaagaacggcatcaaggt 

gaacttcaagatccgccacaacatcgaggacggcagcgtgcagctcgccgaccactacca 

gcagaacacccccatcggcgacggccccgtgctgctgcccgacaaccactacctgagcac 

CCAGTCCGCCCTGAGCAAAGACCCCAACGAGAAGCGCGATCACATGGTCCTGCTGGAGTT 
CGTGACCGCCGCCGGGATCACTCTCGGCATGGACGAGCTGTACAAGTAAAGCGGCCGCGA 
CTCTAGAGTCGACCTGCAGGCATGCAAGCTTCAGCTGCTCGAGGGGGGGCCCGGTACCCA 
GCTTTTGTTCCCTTTAGTGAGGGTTAATTGCGCGGGAAGTATTTATCACTAATCAAGCAC 
AAGTAATACATGAGAAACTTTTACTACaGCAAGCACAATCCTCCAAAAAA I TTTG 1 I l l I 

acaaaatccctggtgaacatgattgg.a-agggacctactagggtgctgtggaagggtgatg 

gtgcagtagtagttaatgatgaaggaaagggaataattgctctaccattaaccaggacta 

agttactaataaaaccaaattgagtattgttgcaggaagcaagacccaactaccattgtc 

agctgtgtttcctgacctcaatatttgttataaggtttgatatgaatcccagggggaatc 

tcaacccctattacccaacagtcagaaaaatctaagtgtgaggagaacacaatgtttcaa 

ccttattgttataataatgacagtaagaacagcatggcagaatcgaaggaagcaagagac 

caagaatgaacctgaaagaagaatct.aaagaagaaaaaagaagaaatgactggtggaaaa 

taggtatgtttctgttatgcttagcaggaactactggaggaatactttggtggtatgaag 

gactcccacagcaacattatatagggttggtggcgatagggggaagattaaacggatctg 

gccaatcaaatgctatagaatgctggggttccitcccggggtgtagaccatttcaaaatt 

acttcagttatgagaccaatagaagcatgcatatggataataatactgctacattattag 

aagctttaaccaatataactgctctataaataacaaaacagaattagaaacatggaagtt 
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agtaaagacttctggcat/vactcctttacctatttcttctgaagctaacactggactaat 

tagacataagagagattttggtataagtgcaatagtggcagctattgtagccgctactgc 

tattgctgctagcgctactatgtcttatgttgctctaactgaggtt.^acaaaataatgga 

agtacaaaatcatacttttgaggtagaaaatagtactctaaatggtatggatttaataga 

acgacaaataaagatattatatgctatgattcttcaaacacatgcagatgttcaactgtt 

.aaaggaaagacaacaggtagaggagacatttaattt.a.attggatgtatagaa^gaacaca 

tgtattttgtcatactggtcatccctggaatatgtcatggggacatttaaatgagtcaac 

acaatgggatgactgggtaagcaaav\tgga>vgatttaaatcaagagatactaactacact 

tcatggagccaggaacaatttggcacaatccatgataacattcaatacaccagatagtat 

agctcaatttggaaaagacctttggagtcatattggaaattggattcctggattgggagc 

ttccattataaaatatatagtgatgtttttgcttatttatttgttactaacctcttcgcc 

taagatcctcagggccctctggaaggtgaccagtggtgcagggtcctccggcagtcgtta 

cctgaagaaaaaattccatcacaaacatgcatcgcgagaagacacctgggaccaggccca 

acacaacatacacctagcaggcgtgaccggtggatcaggggacaaatactacaagcagaa 

gtactccaggaacgactggaatggagaatcagaggagtacaacaggcggccaaagagctg 

ggtgaagtcaatcgaggcatttggagagagctatatttccgagaagaccaaaggggagat 

ttctcagcctggggcggctatcaacgagcacaagaacggctctggggggaacaatcctca 

ccaagggtccttagacctggagattcgaagcgaaggaggaaacatttatgactgttgcat 

taaagcccaagaaggaactctcgctatcccttgctgtggatttcccttatggctattttg 

gggactagtaattatagtaggacgcatagcaggctatggattacgtggactcgctgttat 

aat aagg atttgt att ag agg ctt aaatttg at atttg aaat aatc ag aaaaatgcttg a 

ttatattggaagagctttaaatcctggcacatctcatgtatcaatgcctcagtatgttta 

gaaaaacaaggggggaactgtgggg i i i i i atgaggggttttataaatg att ataagagt 

aaaaagaaagttgctgatgctctcataaccttgtataacccaaaggactagctcatgttg 

ctaggcaactaaaccgcaata^ccgcatttgtgacgcgagttccccatrggtgacgcgtt 

AACTTCCTGTTTTT AC AGT ATATAAGTGCTTGTATTCTG AC AATTGGGC ACTC AG ATTCT 

GCGGTCTGAGTCCCTTCTCTGCTGGGCTGAAAAGGCCTTTGTAATAAATATAATTCTCTA 

CTCAGTCCCTGTCTCTAGTTTGTCTGTTCGAGATCCTACAGAGCTCATGCCTTGGCGTAA 

TCATGGTCATAGCTGTTTCCTGTGTGAAATTGTTATCCGCTCACAATTCCACACAACATA 

CGAGCCGGAAGCATAAAGTGTAAAGCCTGGGGTGCCTAATGAGTGAGCTAACTCACATTA 

ATTGCGTTGCGCTCACTGCCCGCTTTCCAGTCGGGAAACCTGTCGTGCCAGTGATGCCCG 

GGCGGCCGAGGCGGCCTACGTGAACCATCACCCAAATCAAG 11 Tl I I GCGGTCG AGGTGC 

CGTAAAGCTCTAAATCGGAACCCTAAAGGGAGCCCCCGATTTAGAGCTTGACGGGGAAAG 

CCGGCGAACGTGGCGAGAAAGGAAGGGAAGAAAGCGAAAGGAGCGGGCGCTAGGGCGCTG 

GCAAGTGTAGCGGTCACGCTGCGCGTAACCACCACACCCGCCGCGCTTAATGCGCCGCTA 

CAGGGC GCGTCCATTCGCCATTCAGGCTGCGCAACTGTTGGGAAGGGCGATCGGTGCGGG 

CCTCTTCGCTATTACGCCAGCCCGGATCGATCCrTATCGGATTTTACCACATTTGTAGAG 

GTTTTACTTGCTTTAAAAAACCTCCCACATCTCCCCCTG 

GCAATTGTTGTTGTrAACTTGTTTATTGCAGCTTATAATGGTTACAAATAAAGCAATAGC 

ATC AC AAATTTC AC AAATAAAGC ATTTTTTTC ACTGC ATTCTAGTTGTGGTTTGTCC AAA 

CTCATCAATGTATCTTATCATGTCTGCTCGAAGCATTAACCCTCACTAAAGGGAAGCGGC 

CGCCCGGGTCGACrrCACAGGTGTTTGCGGCGTCTrrTGGAGTCTCCGGGCCTCAAGACG 

CGGGGGCTGCTCTGCTCGCCCCAC AGCC I'll C lTGTGCCCTCTGGTAGCCTCCCCATGCG 

GAGAAATCGCCCCTCTGGTCCTCGCGGAAGTAGAGCTCCCTCCAGATGCCGCGATTCACC 

TCrCCCAGCTCTTTAGCGGCTTGrrGCACGCCCCTA^ 

AGGACCTCGGCn"GCAAAATCTGGCCCCTAATCCACCrrATCCCn"CTGGAGGGTGTGTGC 
TGGGTGGGACCGGGGCCGAGGTGTCTTCTGGCGATGCAGGTCTGGCTAGGAATCTTCTCC 
TCGGGCAGGGACTGTCTCAGCACGCGGCACCACTGGTCCCCCTCCAGGGGGCCTTGTGGG 
TCGATCrrCCACCAGTCGTrGCGGCGCTrCTCCTCTTTGCTCTCTTCC^ 
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TCTTGATCCCTCGCCTCCTTGCTCTCAGCCATGGTGGCGAATTCTCGAGGCTACCCTCCC 

GGTGGTGGGTCGGTGGTCCCTGGGCAGGGGTCTCCAGATCCCGGACGAGCCCCCAAATGA 

AAGACCCCCGAGACGGGTAGTCAATCACTCTGAGGAGACCCTCCCAACGAACAGCGAGAC 

CACGAGTCGGATGCAACAGCAAGAGGATTTATTGGATACACGGGTACCCGGGCGACTCAG 

TCTATCGGAGGACTGGCGCGCCGAGTGAGGGGTTGTGAGCTCTTTTATAGAGCTCGGGAA 

GC AG AAGCGCGCGAAC AG AAGCG AG AAGCAGGCTGATTGGTTAATTCAAATAAGGC AC AG 

GGTCATTTCAGGTCCTTGGGGGAGCCTGGAAACATCTGATGGGTCTTAAGAAACTGCTGA 

GGGTTGGGCCATATCTGGGGACCATCTGTTCTTGGCCCCGGGCCGGGGCCGAACCGCGGT 

GACCATCTGTTCTTGGCCCCGGGCCGGGGCCGAAACTGCTCACCGCAGATATCCTGTTTG 

GC C C AACGTT AGCTGTTTTCGTGT AC C C GCCCTTG ATCTG AACTTCTCT ATTCTTG GTTT 

GGTATTTTTCCATGCCTTGCAAAATGGCGTTACTGCGGCTATCAGGCTAAGCAATTTGAG 

ATCTGGCCGAGGCGGCCTACTCTGCATTAATGAATCGGCCAACGCGCGGGGAGAGGCGGT 

TTGCGTATTGGGCGCTCTTCCGCTTCCTCGCTCACTGACTCGCTGCGCTCGGTCGTTCGG 

CTGCGGCGAGCGGTATCAGCTCACTCAAAGGCGGTAATACGGTTATCCACAGAATCAGGG 

GATAACGCAGGAAAGAACATGTATAACTTCGTATAATGTATGCTATACGAAGTTATACAT 

GTGAGCAAAAGGCCAGCAAAAGGCCAGGAACCGTAAAAAGGCCGCGTTGCTGGCG I t 1 11 

CCATAGGCTCCGCCCCCCTGACGAGCATCAC.^AAAATCGACGCTCAAGTCAGAGGTGGCG 

A.AACCCGACAGGACTATAAAGATACCAGGCGTTTCCCCCTGGAAGCTCCCTCGTGCGCTC 

TCCTGTTCCGACCCTGCCGCTTACCGGATACCTGTCCGCCTTTCTCCCTTCGGGAAGCGT 

GGCGCTTTCTCATAGCTCACGCTGTAGGTATCTCAGTTCGGTGTAGGTCGTTCGCTCCAA 

gctgggctgtgtgcacgaaccccccgttcagcccgaccgctgcgccttatccggtaacta 

tcgtcttgagtccaacccggtaagacacgacttatcgccactggcagcagccactggtaa 

caggattagcagagcgaggtatgtaggcggtgctacagagttcttgaagtggtggcctaa 

cracggctacactagaaggacagtatttggtatctgcgctctgctgaagccagttacctt 

cggaaaaagagttggtagctcttgatccggcaaacaaaccaccgctggtagcggtggttt 

ttttgtttgcaac^agcagattacgcck:agaaaaaaaggatctcaagaagatcct^ 

cntttctacggggtctgacgctcagtggaacgaaaactcacgttaagggattttggtcat 

gagattatcaaaaaggatcntcacctagatccttttaaattaaaaatgaagttttaaatc 

AATCTAAAGTATATATGAGTAAACTTGGTCTGACAGTTACCAATGCTTAATCAGTGAGGC 

ACCTATCTCAGCGATCTGTCTATTTCGTTCATCCATAGTTGCCTGACTCCCCGTCGTGTA 

GATAACTACGATACGGGAGGGCTTACCATCTGGCCCCAGTGCTGCAATGATACCGCGAGA 

CCCACGCTCACCGGCTCCAGATTTATCAGCAATAAACCAGCCAGCCGGAAGGGCCGAGCG 

CAGAAGTGGTCCTGCAACTTTATCCGCCTCCATCCAGTCTATTAATTGTTGCCGGGAAGC 

TAGAGTAAGTAGTTCGCCAGTTAATAGTTTGCGCAACGTTGTTGCCATTGCTACAGGCAT 

CGTGGTGTCACGCTCGTCGTTTGGTATGGCTTCATTCAGCTCCGGTTCCCAACGATCAAG 

GCGAGTTACATGATCCCCCATGTTGTGCAAAAAAGCGGTTAGCTCCTTCGGTCCTCCGAT 

CGTTGTCAGAAGTAAGTTGGCCGCAGTGTTATCACTCATGGTTATGGCAGCACTGCATAA 

TTCTCrTACTGTCATGCCATCCGTAAGATGCTTTTCTGTGACTGGTGAGTACTCAACCAA 

GTCATTCTGAGAATAGTGTATGCGGCGACCGAGTTGCTCTTGCCCGGCGTCAATACGGGA 

TAATACCGCGCCACATAGCAGAACTTTAAAAGTGCTCATCATTGGAAAACGTTCTTCGGG 

GCGAAAACTCTCAAGGATCTTACCGCTGTTGAGATCCAGTTCGATGTAACCCACTCGTGC 

ACCCAACTGATCTTCAGCATCTTTTACrTTCACCAGCGTTTCTGGGTGAGCAAAAACAGG 

AAGGCAAAATGCCGCAAAAAAGGGAATAAGGGCGACACGGAAATGTTGAATACTCATACT 

CTTCC 11 in C AATATT ATTG AAGC ATTTATC AGGGTTATTGTCTCATGAGCGG ATAC AT 

ATTTG AATGT ATTT AG AAAAAT AAAC AAAT AGGGGTTC CGCGC AC ATTTCCCCG AAAAGT 

GCCACCTAAATTGTAAGCGTTAATATTTTGTTAAAATTCGCGTTAAATTTT^ 

AGCTCATTTTTTAACCAuATAGGCCGAAATCGGCAAAATCCCTTATAAATCAAAAGAATAG 

ACCGAGATAGGGTTGAGTGTTGTTCCAGTTTGGAACAAGAGTCCACTATTAAAGAACGTG 

GACTCCAACGTCAAAGGGCGAAAAACCGTCTATCAGGGCGATGGCCCACTACGTGATAAC 
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TTCGTATAATGTATGCTATACGAAGTTATCACTACOTGAACCATCACCCTAATCAAGTTT 
TTTGGGGTCGAGGTGCCGTA.AAGCACTAAATCGGAACCCT.\AAGGGAGCCCCCGATTTAG 
AGCTTGACGGGGAAAGCCA-ACCTGGCTTATCG/WXTTAATACGACTCACTATAGGGAGAC 
CGGC 

pONY8.3G FB29 + (SEQ ID No 46) 

AG ATCTTG AATAAT AAAA TG TGTGTTTGTC CG AAAT ACGCGTTTTG A G ATTTCTGTC G C C 

GACTAAATTCATGTCGCGCGATAGTGGTGTTTATCGCCGATAGAGATGGCGATATTGGAA 

A.^ATTGATATTTGAAAATATGGCATATTG/V.'VAATGTCGCCGATGTGAGTTTCTGTGT/VXC 

TG AT ATCGCC ATTTTTCC AAAAGTG ATTTTTG GGC AT ACGCG AT ATCTGGCG AT A G C G CT 

TATATCGTTTACGGGGGATGGCGATAGACGACTTTGGTGACTTGGGCGATTCTGTGTGTC 

GCAAATATCGCAGTTTCGATATAGGTGACAGACGATATGAGGCTATATCGCCGATAGAGG 

CGACATCAAGCTGGCACATGGCCAATGCATATCGATCTATACATTGAATCAATATTGGCC 

ATTAGCCATATTATTCATTGGTTATATAGCATAAATCAATATTGGCTATTGGCCATTGCA 

TACGTTGTATCCATATCGTAATATGTACATTTATATTGGCTCATGTCCAACATTACCGCC 

ATGTTGACATTGATTATrGACTAGTTATTAATAGTAATCAATTACGGGGTCATTAGTTCA 

TAGCCCATATATGGAGTTCCGCGTTACATAACTTACGGTAAATGGCCCGCCTGGCTGACC 

GCCCAACGACCCCCGCCCATTGACGTCAATAATGACGTATGTTCCCATAGTAACGCCAAT 

AGGGACTTTCCATTGACGTCAATGGGTGGAGTATTTACGGTAAACTGCCCACTTGGCAGT 

ACATCAAGTGTATCATATGCCAAGTCCGCCCCCTATTGACGTCAATGACGGTAAATGGCC 

CGCCTGGCATTATGCCCAGTACATGACCTTACGGGACTTTCCTACTTGGCAGTACATCTA 

CGTATTAGTCATCGCTATTACCATGGTGATGCGGTTTTGGCAGTACACCAATGGGCGTGG 

ATAGCGGTTTGACTCACGGGGATTTCCAAGTCTCCACCCCATTGACGTCA.ATGGGAGTTT 

GTTTTGGCACCAAAATCAACGGGACTTTCCAAAATGTCGTAACAACTGCGATCGCCCGCC 

CCGTTGACGCAAATGGGCGGTAGGCGTGTACGGTGGGAGGTCTATATAAGCAGAGCTCGT 

TTAGTGAACCGGGCACTCAGATTCTGCGGTCTGAGTCCCTTCTCTGCTGGGCTGAAAAGG 

CCTTTGTAATAAATATAATTCTCTACTCAGTCCCTGTCTCTAGTTTGTCTGTTCGAGATC 

CTACAGTTGGCGCCCGAACAGGGACCTGAGAGGGGCGCAGACCCTACCTGTTGAACCTGG 

CTG ATCGT AGGATCCCCGGG AC AGC AG AGG AG AACTT AC AG AAGTCTTCTGG AGGTGTTC 

CTGGCCAGAACACAGGAGGACAGGTAAGATTGGGAGACCCTTTGACATTGGAGCAAGGCG 

CTCAAGAAGTTAGAGAAGGTGACGGTACAAGGGTCTCAGAAATTAACTACTGGTAACTGT 

AATTGGGCGCT AAGTCT AGT AG ACTT ATTTC ATG AT ACC AACTTTGT AAAAG AAAAGG AC 

TGCKrAGCTGAGGGATGTCATTCCATTGCTGGAAGATGTAACTCAGACGCTGTCAGGACAA 

GAAAGAGAGGCCTTTGAAAGAACATGGTGGGCAATTTCTGCTGTAAAGATGGGCCTCCAG 

ATTAATAATGTAGTAGATGGAAAGGCATCATTCCAGCTCCTAAGAGCGAAATATGAAAAG 

AAGACTGCTAATAAAA.A,GCAGTCTGAGCCCTCTGAAGAATATCTCTAGAACTAGTGGATC 

CCCCGGGCTGCAGGAGTGGGGAGGCACGATGGCCGCTTTGGTCGAGGCGGATCCGGCCAT 

TAGCCATATTATTCATTGGTTATATAGCATA.\ATCAATATTGGCTATTGGCCATTGCATA 

CGTTGTATCCATATCATAATATGTACATTTATATTGGCTCATGTCCAACATTACCGCCAT 

GTTGACATrGATTATTGACTAGTTATTAATAGTAATCAATTACGGGGTCATTAGTTCATA 

GCCCATATATGGAGTTCCGCGTTACATAACTTACGGTAAATGGCCCGCCTGGCTGACCGC 

CCAACGACCCCCGCCCATTGACGTCAATAATGACGTATGTTCCCATAGTAACGCCAATAG 

GGACTTTCCATTGACGTCAATGGGTGGAGTATTTACGGTAAACTGCCCACTTGGCAGTAC 

ATCAAGTGTATCATATGCCAAGTACGCCCCCTATTGACGTCAATGACGGTAAATGGCCCG 

CCTGGCATrATGCCCAGTACATGACCTTATGGGACTTTCCTACTTGGCAGTACATCTACG 

TATTAGTCATCGCTATTACCATGGTGATGCGGTTTTGGCAGTACATCAATGGGCGTGGAT 

AGCGGTTTGACTCACGGGGATTTCCAAGTCTCCACCCCATTGACGTCAATGGGAGTTTGT 

TTTGGCACCAAAATCAACGGGACTTTCCAAAATGTCGTAACAACTCCGCCCCATTGACGC 
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AAATGGGCGGTAGGCATGTACGGTGGGAGGTCTATATAAGCAGAGCTCGTTTAGTGA/\CC 

GTCAGATCGCCTGGAGACGCCATCCACGCTGTTTTGACCTCCATAGAAGACACCGGGACC 

GATCCAGCCTCCGCGGCCCCAAGCTTGTTGGGATCCACCGGTCGCCACCATGGTGAGCAA 

GGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGGTCGAGCTGGACGGCGACGTAAA 

CGGCCAC.AAGTTCAGCGTGTCCGGCGAGGGCGAGGGCGATGCCACCTACGGCAAGCTGAC 

CCTGAAGTTCATCTGCACCACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCAC 

CCTGACCTACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCACATGAAGCAGCACGACTT 

cttcaagtccgccatgcccgaaggctacgtccaggagcgcaccatcttcttcaaggacga 

cggcaactacaagacccgcgccgaggtgaagttcgagggcgacaccctggtgaaccgcat 

cg agctg aa gggc atcg acttc aagg ag g ac ggc aac atc ctggggc ac aagctg g agt a 

caactacaacagccacaacgtctatatcatggccgacaagcagaagaacggcatcaaggt 

gaacttcaagatccgccacaacatcgaggacggcagcgtgcagctcgccgaccactacca 

gcagaacacccccatcggcgacggccccgtgctgctgcccgacaaccactacctgagcac 

ccagtccgccctgagcaaagaccccaacgagaagcgcgatcacatggtcctgctggagtt 

cgtgaccgccgccgggatcactctcggcatggacgagctgtacaagtaaagcggccccga 

ctctagagtcgacctgcaggcatgcaagcttcagctgctcgagggggggcccggtaccca 

gcttttgttccctttagtgagggttaattgcgcggcaagtat^ 

aagtaatacatgagaaacttttactacagcaagcacaatcctccaaaaaat^^ 

acaaaatccctggtgaacatgattggaagggacctactagggtgctgtggaagggtgatg 

gtgcagtagtagttaatgatgaaggaaaggg.'\ataattgctgtaccattaaccaggacta 

agttactaataaaaccaaattgagtattgttgcagga.agcaagaccca.actaccattgtc 

agctgtgtttcctgacctcaatatttgttataaggtttgatatgaatcccagggggaatc 

tcaacccctattacccaacagtcagaaaaatctaagtgtgaggagaacacaatgtttcaa 

ccttattgttataataatgacagta-agaacagcatggcagaatcgaaggaagcaagagac 

caagaatgaacctgaaagaagaatctaaagaagaaaaaagaagaaatgactggtggaaaa 

taggtatgtttctgttatgcttagcaggaactactggaggaatactttggtggtatgaag 

gactcccacagcaacattatatagggttggtggcgatagggggaagattaaacggatctg 

gccaatcaaatgcratagaatgctggggttccttcccggggtgtagaccatttcaaaatt 

acttcagttatgagaccaatagaagcatgcatatggataataatactgctacattattag 

aagctttaaccaatataactgctctataaataacaaaacagaattagaaacatggaagtt 

agtaaagacttctggcataactcctttaccta rue i ictgaagctaacactggactaat 

tagacataagagagattttggtataagtgcaatagtggcagctattgtagccgctactgc 

tattgctgctagcgctactatgtcrtatgttgctctaactgaggttaacaaaataatgga 

agtacaaaatcatacttttgaggtagaaaatagtactctaaatggtatggatttaataga 

ac g ac aaat aaag at att at atgct atg attcttc aaac ac atgc ag atgttc aactgtt 

aaaggaaagacaacaggtagaggagacatttaatttaattggatgtatagaaagaacaca 

tgtattttgtcatactggtcatccctggaatatgtcatggggacatttaaatgagtcaac 

acaatgggatgactgggtaagcaaaatggaagatttaaatcaagagatactaactacact 

tcatggagccaggaacaatttggcacaatccatgat.\acattcaatacaccagatagtat 

agctcaatttggaaaagacctttggagtcatattggaaattggattcctggattgggagc 

ttc c att at aa aat atatagtg atgtttttg ctt attt atttg tt act aac ctcttc gcc 

taagatcctcagggccctctggaaggtgaccagtggtgcagggtcctccggcagtcgtta 

cctgaagaaaaaattccatcacaaacatgcatcgcgagaagacacctgggaccaggccca 

acacaacatacacctagcaggcgtgaccggtggatcaggggacaaatactacaagcagaa 

gtactccaggaacgactggaatggagaatcagaggagtacaacaggcggccaaagagctg 

ggtgaagtcaatcgaggcatttggagagagctatatttccgagaagaccaaaggggagat 

ttctcagcctggggcggctatcaacgagcacaagaacggctctggggggaacaatcctca 

ccaagggtccttagaccrggagattcgaagcgaaggaggaaacattratgactgttgcat 

taaagcccaagaaggaactctcgctatcccitgctgtgga^ 
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gggactagtaattatagtaggacgcatagcaggctatggattacgtggactcgctgttat 

aataaggatttgtattagaggctt.aaa'ntgatatttgaaataatcagaaaaatgcttga 

ttat attg g aag agcttt aa atcctg gc ac atctc atgtatc aatgcctc agtatgttta 

gaaaaacaaggggggaactgtgggg 1 1 ! j 1 atgaggggttttataaatgattataagagt 

.^aaaagaaagttgctgatgctctcataaccttgtataacccaaaggactagctcatgttg 

ctaggcaactaaaccgcaataaccgcatttgtgacgcgagttccccattggtgacgcgtt 

aacttcctgtttttacagtatataagtgcttgtattctgacaattgggcactcagattct 

gcggtctgagtcccttctctgctgggctgaaaaggcctttgtaataaatataattctcta 

ctcagtccctgtctctagtttgtctgttcgagatcctacagagctcatgccttggcgtaa 

tcatggtcatagctgtttcctgtgtgaaattgttatccgctcacaattccacacaacata 

cgagccggaagcataaagtgtaaagcctggggtgcctaatgagtgagctaactcacatta 

attgcgttgcgctcactgcccgctttccagtcgggaaacctgtcgtgccagagtaggccg 

cctcggccagatctcaaattgcttagcctgatagccgcagtaacgccattttgcaaggca 

tggaaaaataccaaaccaagaatagagaagttcagatcaagggcgggtacacgaaaacag 

ct aac gttgg gcc aaac agg at atctgcg gtg agc agtttcggcc ccggcccg gggcc aa 

gaacagatggtcaccgcggttcggccccggcccggggccaagaacagatggtccccagat 

atggcccaaccctcagcagtttctt^gacccatcagatgtttccaggctcccccaagga 

cctgaaatgaccctgtgccttatttgaattaaccaatcagcctgcttctcgcttctgttc 

gcgcgcttctgcttcccgagctctataaaagagctcacaacccctcactcggcgcgccag 

tcctccgatagactgagtcgcccgggtacccgtgtatccaataaatcctcttgctgttgc 

atccgactcgtggtctcgctgttccttgggagggtctcctcagagtgattgactacccgt 

ctcgggggtctttcatttgggggctcgtccgggatctggagacccctgcccagggaccac 

cgacccaccaccgggaggctagcctcgagaattcgccaccatggctgagagcaaggaggc 

cagggatcaagagatgaacctcaaggaagagagcaaagaggagaagcgccgcaacgactg 

gtggaagatcgacccacaaggccccctggagggggaccagtggtgccgcgtgctgagaca 

gtccctgcccgaggagaagattcctagccagacctgcatcgccagaagacacctcggccc 

cggtcccacccagcacacaccctccagaagggataggtggattaggggccagattttgca 

agccgaggtcctccaagaaaggctggaatggagaattaggggcgtgcaacaagccgctaa 

agagctgggagaggtgaatcgcggcatctggagggagctctacttccgcgaggaccagag 

GGGCGATTTCTCCGCATGGGGAGGCTACCAGAGGGCACAAGAAAGGCTGTGGGGCGAGCA 

GAGCAGCCCCCGCGTCTTGAGGCCCGGAGACTCCAAAAGACGCCGCAAACACCTGTGAAG 

TCGACCCGGGCGGCCGCTTCCCTTTAGTGAGGGTTAATGCTTCGAGCAGACATGATAAGA 

TACATTGATGAGTTTGGACAAACCACAACTAGAATGCAGTGAAAAAAATGCTTTATTTGT 

G AAATTTGTG ATGCT ATTGCTTTATTTGT AACC ATT AT AAGCTGC AATAAAC AAGTT AAC 

AACAACAATTGCATTCATTTTATGTTTCAGGTTCAGGGGGAGATGTGGGAGGTTTTTTAA 

AGCAAGTAAAACCTCTACAAATGTGGTAAAATCCGATAAGGATCGATCCGGGCTGGCGTA 

ATAGCGAAGAGGCCCGCACCGATCGCCCTTCCCAACAGTTGCGCAGCCTGAATGGCGAAT 

GGACGCGCCCTGTAGCGGCGCATTAAGCGCGGCGGGTGTGGTGGTTACGCGCAGCGTGAC 

CGCTACACnTGCCAGCGCCCTAGCGCCCGCTCCTTTCGCTTTCTTCCCnTCC^^ 

CACGTrCGCCGGCTTTCCCCGTCAAGCTCTAAATCGGGGGCTCCCTTTAGGGTTCCGATT 

TAGAGCTTTACGGCACCTCGACCGCAAuAAAACTTGATTTGGGTGATGGTTCACGTAGGCC 

GCCTCGGCCGCCCGGGCATCACTGCATTAATGAATCGGCCAACGCGCGGGGAGAGGCGGT 

TTGCGTATTGGGCGCTCTTCCGCrrCCTCGCTCACTGACTCGCTGCGCTCGGTCGTTCGG 

CTGCGGCGAGCGGTATCAGCTCACTCAAAGGCGGTAATACGGTTATCCACAGAATCAGGG 

G AT AACGC AGG AAAG AAC ATGT AT AACTTCGT AT AATGT ATGCTAT ACG AAGTT AT AC AT 

GTGAGCAAAAGGCCAGCAAAAGGCCAGGAACCGTAAAAAGGCCGCGTTGCTGGCGTTTTT 

CCATAGGCTCCGCCCCCCTGACGAGCATCACAAAAATCGACGCTCAAGTCAGAGGTGGCG 

AAACCCGACAGGACTATAAAGATACCAGGCGTTTCCCCCTGGAAGCTCCCTCGTGCGCTC 

TCCTGTTCCGACCCTGCCGCTTACCGGATACCTGTCCGCCITTCTCCCTTCGGGAAGCGT 
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GGCGCTTTCTCATAGCTCACGCTGTAGGTATCTCAGTTCGGTGTAGGTCGTTCGCTCCAA 

GCTGGGCTGTGTGCACGAACCCCCCGTTCAGCCCGACCGCTGCGCCTTATCCGGTAACTA 

TCGTCTTGAGTCCAACCCGGTAAGACACGACTTATCGCCACTGGCAGCAGCCACTGGTAA 

C AG G ATT AGC AG AGCG AGGT ATGT AGGCGGTGCTACAG AGTTCTTG AAGTG GTGGCCT AA 

CTACGGCTACACTAGAAGGACAGTATTTGGTATCTGCGCTCTGCTGAAGCCAGTTACCTT 

CGGAAAAAGAGTTGGTAGCTCTTGATCCGGCAAAC.AAACCACCGCTGGTAGCGGTGGTTT 

TmGTTTGCAAGCAGCAGATTACGCGCAGAAAAAAAGGATCTCAAGAAGATCCTTTGAT 

CTTTTCTACGGGGTCTGACGCTCAGTGGAACGAAAACTCACGTTAAGGGATTTTGGTCAT 

GAG ATT ATC AAAAAG G ATCTTC ACCT AG ATCCTTTT AAATT AAAAATG AAGTTTT AAATC 

AATCTAAAGTATATATGAGTAAACTTGGTCTGACAGTTACCAATGCTTAATCAGTGAGGC 

ACCT ATCTC AGC G ATCTGTCT ATTTC GTTC ATCC AT AGTTG CCTG ACTC C CCGTCGTGT A 

GATAACTACGATACGGGAGGGCTTACCATCTGGCCCCAGTGCTGCAATGATACCGCGAGA 

CCCACGCTCACCGGCTCCAGATTTATCAGCAATAAACCAGCCAGCCGGAAGGGCCGAGCG 

CAGAAGTGGTCCTGCAACTTTATCCGCCTCCATCCAGTCTATTAATTGTTGCCGGGAAGC 

TAGAGTAAGTAGTTCGCCAGTTAATAGTTTGCGCAACGTTGTTGCCATTGCTACAGGCAT 

CGTGGTGTCACGCTCGTCGTTTGGTATGGCTTCATTCAGCTCCGGTTCCCAACGATCAAG 

gcgagttacatgatcccccatgttgtgcaaaaaagcggttagctccttcgctcctccgat 
cgttgtcagaagtaagttggccgcagtgttatcactcatggttatggcagcactgcataa 

TTCTCTT ACTGTC ATGCC ATC CGT AAG ATGCTTTTCTGTG ACTG GTG AG T ACTC AACC AA 

GTCATTCTGAGAATAGTGTATGCGGCGACCGAGTTGCTCTTGCCCGGCGTCAATACGGGA 

TAATACCGCGCCACATAGCAGAACTTTAAAAGTGCTCATCATTGGAAAACGTTCTTCGGG 

GCGAAAACTCTCAAGGATCTTACCGCTGTTGAGATCCAGTTCGATGTAACCCACTCGTGC 

ACCCAACTGATCTTCAGCATCTTTTACTTTCACCAGCGTTTCTGGGTGAGCAAAAACAGG 

AAGGCAAAATCKTCGCAAAAAAGGGAATAAGGGCGACACGGAAATGTTGAATACTCATACT 

CTTCC im T'C AATATT ATTG AAGC ATTT ATC AGGGTT ATTGTCTC ATG AGCGG AT AC AT 

ArrTGAATGTATTTAGAAAAATAAACAAATA<^GGTTCCGCGCACATTTCCCCGAAAAGT 

GCCACCrAAATTGTAAGCGTTAATAriTTGTTAAAATTC 

AGCTCATTTTTTAACCAATAGGCCGAAATCGGCAAAATCCCTTATAAATCAAAAGAATAG 
ACCGAGATAGGGTTGAGTGTTGTTCCAGTTTGGAACAAGAGTCCACTATTAAAGAACGTG 

g actcc aacgtc aaagggcg aaaaaccgtct atc ag ggcg atggccc actacgtg ataac 
ttcgtataatgtatgctatacgaagttatcactacgtgaaccatcaccctaatcaagttt 
tttggggtcgaggtgccgtaaagcactaaatcggaacccraaagggagcccccgatttag 
agcttgacggggaaagccaacctggcttatcgaaattaatacgactcactatagggagac 

CGGC 

pONY8.3GPGK - (SEQ ID No 47) 

agatcitgaataataaaatgtgtgtttgtccgaaatacgcgttt^ 

gactaaattcatgtcgcgcgatagtggtgtttatcgccgatagagatggcgatattggaa 

a.\attgatatttgaaaatatggcatattgaaaatgtcgccgatgtgagtttctgtgtaac 

tgatatcgccatttttccaaaagtgatttttgggcatacgcgatatctggcgatagcgct 

tatatcgtttacgggggatggcgatagacgactttggtgacttgggcgattctgtgtgtc 

gcaaatatcgcagtttcgatataggtgacagacgatatgaggctatatcgccgatagagg 

cgacatcaagctggcacatggccaatgcatatcgatctatacattgaatcaatattggcc 

attagccatattattcattggttatatagcataaatcaatattggctattggccattgca 

tacgttgtatccatatcgtaatatgtacatttatattggctcatgtccaacattaccgcc . 

atgttgacattgattattgactagttattaatagtaatcaattacggggtcattagttca 

tagcccatatatggagttccgcgttacataacttacggtaaatggcccgcctggctgacc 

gccc.^acgacccccgcccattgacgtcaataatgacgtatgttcccatagtaacgccaat 



WO 01/25466 



PCT/GB00/03837 



I I 

agggactttccattgacgtcaatgggtggagtatttacggta.vvctgcccacttggcagt 

acatcaagtgtatcatatgccaagtccgccccctattgacgtcaatgacggtaaatggcc 

cgcctggcattatgcccagtacatgaccttacgggactttcctacttggcagtacatcta 

cgtattagtcatcgctattaccatggtgatgcggttttggcagtacaccaatgggcgtgg 

atagcggtttgactcacggggatttccaagtctccaccccattgacgtcaatgggagttt 

gttttggcaccaaaatcaacgggactttccaaaatgtcgtaacaactgcgatcgcccgcc 

ccgttgacgcaaatgggcggtaggcgtgtacggtgggaggtctatataagcagagctcgt 

tt agtg a ac cg g gc actc ag attctgcg gtctg a gtcccttctctg ctg ggctg aaaagg 

cctttgt.aataaatataattctctactcagtccctgtctctagtttgtctgttcgagatc 

ctacagttggcgcccgaacagggacctgagaggggcgcagaccctacctgttgaacctgg 

ctgatcgtaggatccccgggacagcagaggagaacttacagaagtcttctggaggtgttc 

ctggccagaacacaggaggacaggtaagattgggagaccctttgacattggagcaaggcg 

ctcaag.aagttagagaaggtgacggtaca.agggtctcagaaattaactactggtaactgt 

aattgggcgcta-agtctagtagacttatttcatgataccaactttgta.-\aagaaaaggac 

tggcagctgagggatgtcattccattgctggaagatgtaactcagacgctgtcaggaca-a 

gaaagagaggcctttgaaagaacatggtgggcaatttctgctgtaaagatgggcctccag 

attaat.a.\tgtagtagatggaaaggcatcattccagctcctaagagcgaaatatgaaaag 

aagactgctaataaaaagcagtctgagccctctgaagaatatctctagaactagtggatc 

ccccgggctgcaggagtggggaggcacgatggccgctttggtcgaggcggatccggccat 

tagccatattattcattggttatatagcataaatcaatattggctattggccattgcata 

cgttgtatccatatcataatatgtacatttatattggctcatgtccaacattaccgccat 

gttg ac attg att attg act agtt att aat agt aatc aatt acggggtc att agttc at a 

gcccatatatggagttccgcgttacataacttacggtaaatggcccgcctggctgaccgc 

ccaacgacccccgcccattgacgtcaataatgacgtatgttcccatagtaacgccaatag 

ggactttccattgacgtcaatgggtggagtatttacggtaaactgcccacttggcagtac 

atcaagtgtatcatatgccaagtacgccccctattgacgtcaatgacggtaaatggcccg 

cctg gc attatgccc ag tac atg acctt atggg actttcct acttggc agt ac atct ac g 

tatragtcatcgctattaccatggtgatgcggttttggcagtacatcaatgggcgtggat 

agcggtitgactcacggggatttccaagtctccaccccattgacgtcaatgggagtttgt 

tttg gc acc aaaatc aacg gg actttcc aaaatgtc gt aac aactccgcccc attg acgc 

aaatgggcggtaggcatgtacggtgggaggtctatataagcagagctcgtttagtgaacc 

gtcagatcgcctggagacgccatccacgctgttttgacctccatagaagacaccgggacc 

gatccagcctccgcggccccaagcttgttgggatccaccggtcgccaccatggtgagcaa 

gggcgaggagctgttcaccggggtggtgcccatcctggtcgagctggacggcgacgtaaa 

cggccacaagttcagcgtgtccggcgagggcgagggcgatgccacctacggcaagctgac 

cctgaagttcatctgcaccaccggcaagctgcccgtgccctggcccaccctcgtgaccac 

cctgacctacggcgtgcagtgcttcagccgctaccccgaccacatgaagcagcacgactt 

cttcaagtccgccatgcccgaaggctacgtccaggagcgcaccatcttcttcaaggacga 

cggc.^\ctacaagacccgcgccgaggtgaagttcgagggcgacaccctggtgaaccgcat 

cgagctgaagggcatcgacttcaaggaggacggcaacatcctggggcacaagctggagta 

c aact ac aac agcc ac aac gtct at atc atggccg ac aagc ag aagaacggc atc aaggt 

gaacttcaagatccgccacaacatcgaggacggcagcgtgcagctcgccgaccactacca 

gcagaacacccccatcggcgacggccccgtgctgctgcccgacaaccactacctgagcac 

CCAGTCCGCCCTGAGCAAAGACCCCAACGAGAAGCGCGATCACATGGTCCTGCTGGAGTT 

CGTGACCGCCGCCGGGATCACTCTCGGCATGGACGAGCTGTACAAGTAAAGCGGCCGCGA 

CTCTAGAGTCGACCTGCAGGCATGCAAGCTTCAGCTGCTCGAGGGGGGGCCCGGTACCCA 

GCTTl IGTTCCCrTTAGTGAGGGTrAATTGCGCGGGAAGTATTTATCACTAATCAAGCAC 

AAGTAATACATGAGAAACTTTTACTACAGCAAGCACAATCCTCCAAAAAATTTrGTTT^ 

AC.AAAATCCCTGGTGAACATGATTGGA.AGGGACCTACTAGGGTGCTGTGGAAGGGTGATG 
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GTGCACTAGTAOTTAATGATGAAGGAAAGGGAATAATTGCTGTACCATTAACCACGACTA 
AGTTACTAATAAAACCAAATTGAGTATTGTTGCAGGAAGCAAGACCCAACTACCATTGTC 
AGCTGTGTTTCCTGACCTCAATATTTGTTATAAGGTTTGATATGAATCCCAGGGGGAATC 

tcaacccctattacccaacagtcagaaaaatctaagtgtgaggagaacacaatgtttcaa 

ccttattgttataataatgacagtaagaacagcatggcagaatcgaaggaagcaagagac 

caagaatgaacctgaaagaagaatctaaagaagaaaaaagaagaaatgactggtggaaaa 

TAGGTATGTTTCTGTTATGCTTAGCAGGAACTACTGGAGGAATACTTTGGTGGTATGAAG 

GACTCCCACAGCAACATTATATAGGGTTGGTGGCGATAGGGGGAAGATTAAACGGATCTG 

GCC.\ATCAAATGCTATAGAATGCTGGGGTTCCTTCCCGGGGTGTAGACCATTTCAAAATT 

ACTTCAGTTATGAGACCAATAGAAGCATGCATATGGATAATAATACTGCTACATTATTAG 

AAGCTTTAACCAATATAACTGCTCTATAAATAACAAAACAGAATTAGAAACATGGAAGTT 

AGTAAAGACTTCTGGCATAACTCCrrTACCTATTTCTTCTGAAGCTAACACTGGACTAAT 

TAGACATAAGAGAGATTTTGGTATAAGTGCAATAGTGGCAGCTATTGTAGCCGCTACTGC 

T ATTGCTG CT AG CGCT ACT ATGTCTT ATGTTGCTCTAACTG AGGTT AAC AAAAT AATGG A 

AGTACAAAATCATACTTTTGAGGTAGAAAATAGTACTCTAAATGGTATGGATTTAATAGA 

ACGACAAATAAAGATATTATATGCTATGATTCITCAAACACATGCAGATGTTCAACTGTT 

AAAGGAAAGACAACAGGTAGAGGAGACATTTAATTTAATTGGATGTATAGAAAGAACACA 

TGTATTTTGTCATACTGGTCATCCCTGGAATATGTCATGGGGACATTTAAATGAGTCAAC 

ACAATGGGATGACTGGGTAAGCAAAATGGAAGATTTAAATCAAGAGATACTAACTACACT 

TCATGGAGCCAGGAACAATTTGGCACAATCCATGATAACATTCAATACACCAGATAGTAT 

AGCTCAATTTGGAAAAGACCTrrGGAGTCATATrGGAAATTGGATTCCTGGATTGGGAGC 

TTCCATTATAAAATATATAGTGATGTTTTTGCTTATTTATTTGTTACTAACCTCTTCGCC 

TAAGATCCTCAGGGCCCTCTGGAAGGTGACCAGTGGTGCAGGGTCCTCCGGCAGTCGTTA 

CCTGAAGAAAAAATTCCATCACAAACATGCATCGCGAGAAGACACCTGGGACCAGGCCCA 

ACACAACATACACCTAGCAGGCGTGACCGGTGGATCAGGGGACAAATACTACAAGCAGAA 

GTACTCCAGGAACGACTGGAATGGAGAATCAGAGGAGTACAACAGGCGGCCAAAGAGCTG 

GGTGAAGTCAATCGAGGCATTTGGAGAGAGCTATATTTCCGAGAAGACCAAAGGGGAGAT 

TTCTCAGCCTGGGGCGGCTATCAACGAGCACAAGAACGGCTCTGGGGGGAACAATCCTCA 

CCAAGGGTCCTTAGACCTGGAGATTCGAAGCGAAGGAGGAAACATTTATGACTGTTGCAT 

TAAAGCCCAAGAAGGAACTCTCGCTATCCCTTGCTGTCGATTTCCCITATC 
GGGACTAGTAATTATAGTAGGACGCATAGCAGGCTATGGATTACGTGGACTCGCTGTTAT 

aataaggatttgtattagaggcttaaatttgatatttgaaataatcagaaaaatgcttga 

TTATATrGGAAGAGCTTTAAATCCTGGCACATCTCATGTATCAATGCCTCAGTATGTTTA 

GAAAAACAAGGGGGGAACTGTGGGGTTTTTATGAGGGGTTTrATAAATGATTATAAG 

AAAAAGAAAGTTGCTGATGCTCTCATAACCITGTATAACCCAAAGGACrAGCTCATGTTG 

CTAGGCAACTAAACCGCAATAACCGCATTTGTGACGCGAGTTCCCCATTGGTGACGCGTT 

AACTTCCTGTTTTTACAGTATATAAGTGCTTGTATTCTGACAATTGGGCACTCAGATTCr 

GC GGTCTG AGTCCCTTCTCTGCTG G GCTG AAAAGGCCTTTGT AAT AAAT AT AATTCTCT A 

CTCAGTCCCTGTCTCTAGTTTGTCTGTTCGAGATCCTACAGAGCTCATGCCTTGGCGTAA 

TCATGGTCATAGCTGTTTCCTGTGTGAAATTGTTATCCGCTCACAATTCCACACAACATA 

CGAGCCGGAAGCATAAAGTGTAAAGCCTGGGGTGCCTAATGAGTGAGCTAACTCACATTA 

ATTGCGTTGCGCTCACTGCCCGCTTTCCAGTCGGGAAACCTGTCGTGCCAGTGATGCCCG 

GGCGGCCGAGGCGGCCTACGTGAACCATCACCCAAATCAAGTl I l i i GCGGTCG AGGTGC 

CGTAAAGCTCTAAATCGGAACCCTAAAGGGAGCCCCCGATTTAGAGCrTGACGGGGAAAG 

CCGGCGAACGTGGCGAGAAAGGAAGGGAAGAAAGCGAAAGGAGCGGGCGCTAGGGCGCTG 

GCAAGTGTAGCGGTCACGCTGCGCGTAACCACCACACCCGCCGCGCTTAATGCGCCGCTA 

CAGGGCGCGTCCATTCGCCATTCAGGCTGCGCAACTGTTGGGAAGGGCGATCGGTGCGGG 

CCTCrrCGCTATTACGCCAGCCCGGATCGATCCTrATCGGATTITACCACATTTGTAGAG 

GTTTT ACTTGCTTT AAAAAACCTCCC AC ATCTCCCCCTG AACCTG AAAC AT AAAATG AAT 



WO 01/25466 PCT/GBOO/03837 



13 

GCAATTGTTGTTGTTAACTTGTTTATTGCAGCTTA^ 

ATCACAAATTTCACAAATAAAGCAI TTTTi ICACTGCATTCTAGTTGTGGTTTGTCCAAA 

CTCATCAATGTATCTTATCATGTCTGCTCGAAGCATTAACCCTCACTAAAGGGAAGCGGC 

CGCCCGGGTCGACTTCACAGGTGTTTGCGGCGTCTTTTGGAGTCTCCGGGCCTCAAGACG 

CGGGGGCTGCTCTGCTCGCCCCACAGCCTTTCTTGTGCCCTCTGGTAGCCTCCCCATGCG 

GAGAAATCGCCCCTCTGGTCCTCGCGGAAGTAGAGCTCCCTCCAGATGCCGCGATTCACC 

TCTCCCAGCTCTTTAGCGGCTrGTTGCACGCCCCTAATTCTCCATTCCAGCCTTTCTTGG 

aggacctcggcttgcaaaatctggcccctaatccacctatcccttctggagggtgtctgc 

tgggtgggaccggggccgaggtgtcttctggcgatgcaggtctggctaggaatcttctcc 

tcgggcagggactgtctcagcacgcggcaccactggtccccctccagggggccttgtggg 

tcgatcttccaccagtcgttgcggcgcttctcctctttgctctcttccttgaggttcatc 

tcttgatccctggcctccttgctctcagccatggtggcgaattctcgaggctagcctggg 

gagagaggtcggtgattcggtcaacgagggagccgactgccgacgtgcgctccggaggct 

tgcagaatgcggaacaccgcgcgggcaggaacagggcccacactaccgccccacaccccg 

cctcccgcaccgccccttcccggccgctgctctcggcgcgccccgctgagcagccgctat 

tggccacagcccatcgcggtcggcgcgctgccattgctccctggcgctgtccgtctgcga 

gggtactagtgagacgtgcggcttccgtttgtcacgtccggcacgccgcgaaccgcaagg 

aaccttcccgacttaggggcggagcaggaagcgtcgccggggggcccacaagggtagcgg 

cgaagatccgggtgacgctgcgaacggacgtgaagaatgtgcgagacccagggtcggcgc 

cgctgcgtttcccggaaccacgcccagagcagccgcgtccctgcgcaaacccagggctgc 

cttggaaaaggcgcaaccccaaccccagatctggccgaggcggcctactctgcattaatg 

aatcggccaacgcgcggggagaggcggtttgcgtattgggcgctcttccgcttcctcgct 

cactgactcgctgcgctcggtcgttcggctgcggcgagcggtatcagctcactcaaaggc 

ggtaatacggttatccacagaatcaggggataacgcaggaaagaacatgtataacttcgt 

at aatgt atgct at acg aagtt at ac atgtg agc aaaaggcc agcaaaaggcc agg aacc 

gtaaaaaggccgcgttgctggcgtttttccataggctccgcccccctgacgagcatcaca 

aaaatcgacgctcaagtcagaggtggcgaaacccgacaggactataaagataccaggcgt 

ttccccctggaagctccctcgtgcgctctcctgttccgaccctgccgcttaccggatacc 

tgtccgcctttctccctrcgggaagcgtggcgctttctcatagctcacgctgtaggtatc 

tcagttcggtgtaggtcgttcgctccaagctgggctgtgtgcacgaaccccccgttcagc 

ccg accgctgcgc ctt atccggt aactatcgtcttg agtcc aacccggtaag ac ac g act 

tatcgccactggcagcagccactggtaacaggattagcagagcgaggtatgtaggcggtg 

ctacagagttcttgaagtggtggcctaactacggctacactagaaggacagtatttggta 

tctgcgctctgctgaagccagttaccttcggaaaaagagttggtagctcttgatccggca 

aacaaaccaccgctggtagcggtgg rnrn i gtttgcaagcagcagattacgcgcagaa 

aaaaagg atctc aag aag atc ctttg atcttttct acggg gtctg ac gctc agtgg aacg 

aaaactcacgttaagggattttggtcatgagattatcaaaaaggatcttcacctagatcc 

ttttaaattaaaaatgaagttttaaatcaatctaaagtatatatgagtaaac^ 

acagttaccaatgcttaatcagtgaggcacct atctc agcgatctgtctatttcgttcat 

ccatagttgcctgactccccgtcgtgtagataactacgatacgggagggcttaccatctg 

gccccagtgctgca/vtgataccgcgagacccacgctcaccggctccagatttatcagcaa 

taaaccagccagccggaagggccgagcgcagaagtggtcctgcaactttatccgcctcca 

tccagtctattaattgttgccgggaagctagagtaagtagttcgccagttaatagtttgc 

gcaacgttgttgccattgctacaggcatcgtggtgtcacgctcgtcgtttggtatggctt 

cattcagctccggttcccaacgatcaaggcgagttacatgatcccccatgttgtgcaaaa 

aagcggttagctccttcggtcctccgatcgttgtcagaagtaagttggccgcagtgttat 

cactcatggttatggcagcactgcataattctcttactgtcatgccatccgtaagatgct 

tttctgtgactggtgagtactcaaccaagtcattctgagaatagtgtatgcggcgaccga 

gttgctcttgcccggcgtcaatacgggataataccgcgccacatagcagaactttaaaag 
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TGCTCATCATTGGAAAACGTTCTTCGGGGCG/\AAACTCTCAACGATCTTACCGCTGTTGA 

GATCCAGTTCGATGTAACCCACTCGTGCACCCA.ACTGATCTTCAGCATCTTTTACTTTCA 

CCAGCGTTTCTGGGTGAGCAAAAACAGGAAGCCAAAATGCCGCAAAAAAGGGAATAAGGG 

CGACACGGA/VATGTTGAATACTCATACTCTTCCTTTTTCAATATTATTGAAGCATTTATC 

AGGGTTATTGTCTCATGAGCGGATACATATTTGAATGTATTTAGAAAAATAAACAAATAG 

GGGTTCCGCGCACATTTCCCCGAAAAGTGCCACCTAAATTGTAAGCGTTAATATTTTGTT 

AAAATTCGCGTTAAATTTTTGTT A AATC AGCTC 

C AAAATCCCTT AT AAATC AAAAG AAT AG ACCG AG AT AG GGTTG AGTGTTGTTCC AGTTTG 

GAACAAGAGTCCACTATTAAAGAACGTGGACTCCAACGTCAAAGGGCGAAAAACCGTCTA 

TCAGGGCGATGGCCCACTACGTGATAACTTCGTATAATGTATGCTATACGAAGTTATCAC 

TACGTGAACCATCACCCTAATCAAGTT 1 I I IGGGGTCGAGGTGCCGTAAAGCACTAAATC 

GGAACCCTAAAGGGAGCCCCCGATTTAGAGCTTGACGGGGAAAGCCAACCTGGCTTATCG 

AAATTAATACGACTCACTATAGGGAGACCGGC 
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pONY8.3G PGK + (SEQ ID No 48) 

AGATCTTGA.ATAATAAAATGTGTGTTTCTCCG/V\ATACGCGTTTTGAGATTTCTGTCGCC 

GACTAA.-\TTCATGTCGCGCGATAGTGGTGTTTATCGCCGATAGAGATGGCGATATTGGAA 

/V\ATTGATATTTGAAAATATGGCATATTGAAA>VTGTCGCCGATCTGAGTTTCTGTGTAAC 

TGATATCGCCATTTTTCCAAAAGTGATTTTTGGGCATACGCGATATCTGGCGATAGCGCT 

TATATCGTTTACGGGGGATGGCGATAGACGACTTTGGTGACTTGGGCGATTCTGTGTGTC 

GCAAATATCGCAGTTTCGATATAGGTGACAGACGATATGAGGCTATATCGCCGATAGAGG 

CG AC ATCAAGCTGGCACATGGCCAATGC AT ATCGATCT AT AC ATTGAATCAATATTGGCC 

ArrAGCCATATTATTCATTGGTTATATAGCATAAATCAATATTGGCTATTGGCCATTGCA 

TACGTTGTATCCATATCGTAATATGTACATTTATATTGGCTCATGTCCAACATTACCGCC 

ATGTTGACATTGATTATTGACTAGTTATTAATAGTAATCAATTACGGGGTCATTAGTTCA 

tagcccatatatggagttccgcgttacataacttacggtaaatggcccgcctggctgacc 

gcccaacgacccccgcccattgacgtcaataatgacgtatgttcccatagtaacgccaat 

agggactttccattgacgtcaatgggtggagtatttacggtaaactgcccacttggcagt 

acatc.aagtgtatcatatgccaagtccgccccctattgacgtc.aatgacggt.aaatggcc 

cgcctggcattatgcccagtacatgaccttacgggactttcctacttggcagtacatcta 

cgtattagtcatcgctattaccatggtgatgcggttttggcagtacaccaatgggcgtgg 

atagcggttrgactcacggggatttccaagtcrccaccccattgacgtcaatgggagttt 

gttitggcaccaaaatcaacgggactttccaaaatgtcgtaacaactgcgatcgcccgcc 

ccgttgacgcaaatgggcggtaggcgtgtacggtgggaggtctatataagcagagctcgt 

ttagtgaaccgggcactcagattctgcggtctgagtcccttctctgcrgggctgaaaagg 

CCTTTGTAATAAATATAATTCTCTACTCAGTCCCTGTCTCTAGTrTGTCTGTTCGAGATC 
CTACAGTTGGCGCCCGAACAGGGACCTGAGAGGGGCGCAGACCCTACCTGTTGAACCTGG 
CTGATCGTAGGATCCCCGGGACAGCAGAGGAGAACTTACAGAAGTCTTCTGGAGGTGTTC 
CTGGCCaGAACACAGGAGGACAGGTAAGATTGGGaGACCCTTTGACATTGGAGCAAGGCG 

ctc aag aagtt ag ag aag gtg acggtac aagggtctc ag aaatt aact actggt aactgt 

aattgggcgctaagtctagtagacttatttcatgataccaacmgtaaaagaaaaggac 

tgck:agctgagggatgtcattccattgctggaagatgtaactcagacgctgtcaggacaa 

gaaagagaggcchttgaaagaacatggtgggcaatttctgctgtaaagatgggcctccag 

attaataatgtagtagatggaaagck:atcattccagctcctaagagcgaaatatgaaaag 

aagactgctaataaaaagcagtctgagccctctgaagaatatctctagaactagtggatc 

ccccgggctgcaggagtggggaggcacgatggccgctttggtcgaggcggatccggccat 

tagccatattattcattggttatatagcataaatcaatattggctattggccattgcata 

cgttgtatccatatcataatatgtacatttatattggctcatgtccaacattaccgccat 

gttgacattgattatrgactagttattaatagtaatcaattacggggtcattagttcata 

gcccatatatggagttccgcgttacataacttacggtaaatggcccgcctggctgaccgc 

ccaacgacccccgcccattgacgtcaataatgacgtatgttcccatagtaacgccaatag 

g g actttcc attg acgtc aatgggtg g agtatttacggt aaactgccc acttggc AGT AC 

atcaagtgtatcatatgccaagtacgccccctattgacgtcaatgacggtaaatggcccg 

ccrggcattatgcccagtacatgaccrrtatgggacrttcctacttggcagtacatct 

tattagtcatcgctattaccatggtgatgcggttttggcagtacatcaatgggcgtggat 

agcggtttgactcacggggatttccaagtctccaccccattgacgtcaatgggagtttgt 

tttggcaccaaaatcaacggg actttcc aaaatgtcgtaacaactccgcccc ATTG AC gc 

aaatgggcggtaggcatgtacggtgggaggtctatataagcagagctcgtttagtgaacc 

gtcagatcgcctggagacgccatccacgctgttttgacctccatagaagacaccgggacc 

gatccagcctccgcggccccaagcttgttgggatccaccggtcgccaccatggtgagcaa 

gggcgaggagctgttcaccggggtggtgcccatcctggtcgagctggacggcgacgtaaa 

cggccacaagttcagcgtgtccggcgagggcgagggcgatgccacctacggcaagctgac 
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CCTGA/\GTTCATCTGCACCACCGGC/V\GCTGCCCGTGCCCTGGCCCACCCTCGTGACCAC 

CCTG/XCCTACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCACATGAAGCAGCACGACTT 

CTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGCACCATCTTCTTC/V/\GGACGA 

CGGC/VACTACAAGACCCGCGCCGAGGTGAAGTTCGAGGGCGACACCCTGGTGAACCGCAT 

CGAGCTGAAGGGCATCGACTTCAAGGAGGACGGCAACATCCTGGGGCACAAGCTGGAGTA 

CAACTACAACAGCCACAACGTCTATATCATGGCCGACAAGCAGAAGAACGGCATCAAGGT 

GAACTTCAAGATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACl'ACCA 

GCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAACCACTACCTGAGCAC 

CCACTCCGCCCTGAGCAAAGACCCCAACGAGAAGCGCGATCACATGGTCCTGCTGGAGTT 

CGTGACCGCCGCCGGGATCACTCTCGGCATGGACGAGCTGTACAAGTAAAGCGGCCGCGA 

CTCTAGAGTCGACCTGCACGCATGCAAGCTTCAGCTGCTCGAGGGGGGGCCCGGTACCCA 

GCTTTTGTTCCCTTTAGTGAGGGTTAATTGCGCGGGA^VGTATTTATCACTAATCAAGCAC 

AAGTAATACATGAGAAACTrTTACTACAGCAAGCACAATCCTCCAAAAAATTTTGTTT^ 

ACAAAATCCCTGGTGAACATGATTGGAAGGGACCTACTAGGGTGCTGTGGAAGGGTGATG 

GTGC AGT AGT AGTT AATG ATG AAG G AAAGG G AAT AATTGCTGT ACC ATT AACC AG G ACT A 

AGTTACTAATAAAACCAAATTGAGTATTGTTGCAGGAAGCAAGACCCAACTACCATTGTC 

AGCTGTGTTTCCTGACCTCAATATTTGTTATAAGGTTTGATATGAATCCCAGGGGGAATC 

TC.^ACCCCTATTACCCAACAGTCAGAAAAATCTAAGTGTGAGGAGAACAC.AATGTTTCAA 

CCTTATTGTTATAATAATGACAGTAAGAACAGCATGGCAGAATCGAAGGAAGCAAGAGAC 

CAAGAATGAACCTGAAAGAAGAATCTAAAGAAGAAAAAAGAAGAAATGACTGGTGGAAAA 

TAGGTATGTTTCTGTTATGCTTAGCAGGAACTACTGGAGGAATACTTTGGTGGTATGAAG 

GACTCCCACAGCAACATTATATAGGGTTGGTGGCGATAGGGGGAAGATTAAACGGATCTG 

GCCAATCAAATGCTATAGAATGCTGGGGTTCCTTCCCGGGGTGTAGACCATTTCAAAATT 

ACTTC AGTTATG AG ACC AATAGAAGC ATGC ATATGG AT AAT AATACTGCT AC ATT ATT AG 

AAGCTTTAACCAATATAACTGCTCTATAAATAACAAAACAGAATTAGAAACATGGAAGTT 

AGTAAAGACTTCTGGCATAACTCCTTTACCTATTTCTTCTGAAGCTAACACTGGACTAAT 

TAGACATAAGAGAGATTTTGGTATAAGTGCAATAGTGGCAGCTATTGTAGCCGCTACTGC 

TATTGCTGCTAGCGCTACTATGTCTTATGTTGCTCTAACTGAGGTTAACAAAATAATGGA 

AGTACAAAATCATAC1TTTGAGGTAGAAAATAGTACTCTAAATGGTATGGATTTAATAGA 

ACGACAAATAAAGATATTATATGCTATGATTCTTCAAACACATGCAGATGTTCAACTGTT 

AAAGG AAAG AC AAC AGGTAG AG GAG AC ATTT AATTT AATTGG ATGT AT AG AAAG AAC AC A 

TGT ATTTTGTC AT ACTG GTC ATCCCTG G AAT ATGTC ATGGGG AC ATTT AAATG AGTC AAC 

ACAATGGGATGACTGGGTAAGCAAAATGGAAGATTTAAATCAAGAGATACTAACTACACT 

TC ATG G AGCC AGG AAC AATTTGGC AC AATCC ATG AT AAC ATTC AAT AC ACC AG AT AGT AT 

AGCTCAATTrGGAAAAGACCTTTGGAGTCATATTGGAAATTGGATTCCTGGATTGGGAGC 

TTCCATTATAAAATATATAGTGATGTTTTTGCTTATTTATTTGTTACTAACCTCTTCGCC 

TAAGATCCTCAGGGCCCTCTGGAAGGTGACCAGTGGTGCAGGGTCCTCCGGCAGTCGTTA 

CCTG AAG AAAAAATTCC ATC AC AAAC ATGC ATCGC GAG AAG AC ACCTGGG ACC AGGCCC A 

AC AC AAC ATAC ACCTAGC AGGCGTGACCGGTG G ATC AGGGG AC AAAT ACT AC AAGC AG AA 

GTACTCCAGGAACGACTGGAATGGAGAATCAGAGGAGTACAACAGGCGGCCAAAGAGCTG 

GGTGAAGTCAATCGAGGCATTTGGAGAGAGCTATATTTCCGAGAAGACCAAAGGGGAGAT 

TTCTCAGCCTGGGGCGGCTATCAACGAGCACAAGAACGGCTCTGGGGGGAACAATCCTCA 

CCAAGGGTCCTTAGACCTGGAGATTCGAAGCGAAGGAGGAAACATTTATGACTGTTGCAT . 

TAAAGCCCAAGAAGGAACTCTCGCT ATCCCTTGCTC^ 

GGGACTAGTAATTATAGTAGGACGCATAGCAGGCTATGGATTACGTGGACTCGCTGTTAT 
AAT AAGG ATTTGT ATTAG AG GCTT AAATTTG AT ATTTG AAAT AATC AG AAAAATGCTTG A 
TTATATTGGAAGAGCTTTAAATCCTGGCACATCTCATGTATCAATGCCTCAGTATGTTTA 
GAAAAACAAGGGGGGAACTGTGGGGTTTTTATGAGGGGTTTTATAAATGATTATAAGAGT 
AAAAAG AAAGTTG CTG A TGCTCTC ATAACCTTGTATAACCC AAAGG ACT AGCTC ATGTTG 
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CTAGGCAACTAAACCGC.^TAACCGCATTTGTGACGCGAGTTCCCCATTCGTGACGCOTT 

AACTTCCTGTTTTTACAGTATATAAGTGCTTGTATTCTGACAATTGGGCACTCAGATTCT 

GCGGTCTGAGTCCCTTCTCTGCTGGGCTGAAAAGGCCTTTGTAATAAATATAATTCTCTA 

CTCAGTCCCTGTCTCTAGTTTGTCTGTTCGAGATCCTACAGAGCTCATGCCTTGGCGTAA 

TCATGGTCATAGCTGTTTCCTGTGTGAAATTGTTATCCGCTCACAATTCCACAC.AACATA 

CGAGCCGGAAGCATAAAGTGTAAAGCCTGGGGTGCCTAATGAGTGAGCTAv\CTCACATTA 

ATTGCGTTGCGCTCACTGCCCGCTTTCCAGTCGGGAAACCTGTCGTGCCAGAGTAGGCCG 

CCTCGGCCAGATCTGGGGTTGGGGTTGCGCCTTTTCCAAGGCAGCCCTGGGTTTGCGCAG 

ggacgcggctgctctgggcgtggttccgggaaacgcagcggcgccgaccctgggtctcgc 

acattcttcacgtccgttcgcagcgtcacccggatcttcgccgctacccttgtgggcccc 

ccggcgacgcttcctgctccgcccctaagtcgggaaggttccttgcggttcgcggcgtgc 

cggacgtgacaaacggaagccgcacgtctcactagtaccctcgcagacggacagcgccag 

ggagcaatggcagcgcgccgaccgcgatgggctgtggccaatagcggctgctcagcgggg 

cgcgccgagagcagcggccgggaaggggcggtgcgggaggcggggtgtggggcggtagtg 

tgggccctgttcctgcccgcgcggtgttccgcattctgcaagcctccggagcgcacgtcg 

gcagtcggctccctcgttgaccg.aatcaccgacctctctccccaggctagcctcgagaat 

tcgccaccatggctgagagcaaggaggccagggatcaagagatgaacctca.aggaagaga 

gcaaagaggagaagcgccgcaacgactggtggaagatcgacccacaaggccccctggagg 

gggaccagtggtgccgcgtgctgagacagtccctgcccgaggagaagattcctagccaga 

cctgcatcgccagaagacacctcggccccggtcccacccagcacacaccctccagaaggg 

ataggtggattaggggccagattttgcaagccgaggtcctccaagaaaggctggaatgga 

gaattaggggcgtgcaacaagccgctaaagagctgggagaggtgaatcgcggcatctgga 

gggagctctacttccgcgaggaccagaggggcgatttctccgcatggggaggctaccaga 

gggcacaagaaaggctgtggggcgagcagagcagcccccgcgtcttgaggcccggagact 

ccaaaagacgccgcaaacacctgtgaagtcgacccgggcggccgcttccctttagtgagg 

gttaatgcttcgagcagacatgataagatacattgatgagtttggacaaaccacaactag 

aatgcagtgaaaaaaatgctttatttgtgaaatttgtc 

cattataagctgca.ataaacaagttaacaacaacaattgcattcattttatgtttcaggt 

tcagggggagatgtgggaggttttttaaagcaagtaaaacctctacaaatgtggtaaaat 

ccgataaggatcgatccgggctggcgtaatagcgaagaggcccgcaccgatcgcccttcc 

caacagttgcgcagcctgaatggcgaatggacgcgccctgtagcggcgcattaagcgcgg 

cgggtgtggtggttacgcgcagcgtgaccgctacacttgccagcgccctagcgcccgctc 

ctttcgctttcrtcccttcctttctcgccacgttcgccggctttc 

atcgggggctccctttagggttccgatttagagctttacggcacctcgaccgcaaaaaac 

ttgatttgggtgatggttcacgtaggccgcctccgccgcccgggcatcactgcattaatg 

aatcggccaacgcgcggggagaggcggtttgcgtattgggcgctcttccgcttcctcgct 

cactgactcgctgcgctcggtcgttcggctgcggcgagcggtatcagctcactcaaaggc 

ggtaat acggtt atcc ac ag aatc agggg ataacgc agg aaagaac atgtataacttcgt 

ataatgtatgctatacgaagttatacatgtgagcaaaaggccagcaaaaggccaggaacc 

GTAAAAAGGCCGCGTTGCTGGCGTTTTTCCATAGGCTCCGCCCCCCTGACGAGCATCACA 

AAAATCGACGCTCAAGTCAGAGGTGGCGAAACCCGACAGGACTATAAAGATACCAGGCGT 

TTCCCCCTGGAAGCTCCCTCGTGCGCTCTCCTGTTCCGACCCTGCCGCTTACCGGATACC 

TGTCCGCCHTrCTCCCTTCGGGAAGCGTGGCGCTTTCTCATAGCTCACGCTGTAGGTATC 

TCAGTTCGGTGTAGGTCGTTCGCTCCAAGCTGGGCTGTGTGCACGAACCCCCCGTTCAGC 

CCGACCGCTGCGCCTTATCCCGTAACTATCGTCTTGAGTCCAACCCGGTAAGACACGACT 

TATCGCCACTGGCAGCAGCCACTGGTAACAGGATTAGCAGAGCGAGGTATGTAGGCGGTG 

CTACAGAGTTCTTGAAGTGGTGGCCTAACTACGGCTACACTAGAAGGACAGTATTTGGTA 

TCTGCCKrrCTGCTGAAGCCAGTTACCTrCGGAAAAAGAGTTGGTAGCTCTTGATCCGGCA 

AACAAACCACCGCTGGTAGCGGTGGTTTTTTTGTTTGCAAGCAGCAGATTACGCGCAGAA 
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AAAAAGGATCTCAAGAAGATCCTTTGATCTTTTCTACGGGGTCTGACGCTCAGTGGAACG 

A AAACTC ACG TT AAG GG ATTTTG GTC ATG AG ATT ATC AAAAAG G ATCTTC ACCT AG ATCC 

TTTTAAATTAAAAATGAAGTTTTAAATCAATCTAAAGTATATATG^^ 

ACAGTTACCAATGCTTAATCAGTGAGGCACCTATCTCAGCGATCTGTCTATTTCGTTCAT 

CCATAGTTGCCTGACTCCCCGTCGTGTAGATAACTACGATACGGGAGGGCTTACCATCTG 

G CCC C AGTG CTG C AATG AT ACC GCG AG ACCC ACGCTC ACCGGCTCC AG ATTT ATC AG C AA 

TAAACCAGCCAGCCGGAAGGGCCGAGCGCAGAAGTGGTCCTGCAACTTTATCCGCCTCCA 

TCCAGTCTATTAATTGTTGCCGGGAAGCTAGAGTAAGTAGTTCGCCAGTTAATAGTTTGC 

GCAACGTTGTTGCCATTGCTACAGGCATCGTGGTGTCACGCTCGTCGTTTGGTATGGCTT 

CATTCAGCTCCGGTTCCCAACGATCA.\GGCGAGTTACATGATCCCCCATGTTGTGCAAAA 

AAGC GGTT AGCTC CTTCG GTC CTCCG ATCGTTGTC AG AAGT AAGTTGG CCGC AGTG TT AT 

CACTCATGGTTATGGCAGCACTGCATAATTCTCTTACTGTCATGCCATCCGTAAGATGCT 

TTTCTGTGACTGGTGAGTACTCAACCAAGTCATTCTGAGAATAGTGTATGCGGCGACCGA 

GTTGCTCTTGCCCGGCGTCAATACGGGATAATACCGCGCCACATAGCAGAACTTTAA.^AG 

TGCTCATCATTGGAAAACGTTCTTCGGGGCGAAAACTCTCAAGGATCTTACCGCTGTTGA 

GATCCAGTTCGATGTAACCCACTCGTGCACCCAACTGATCTTCAGCATCTTTTACTTTCA 

CCAGCGTTTCTGGGTGAGCAAAAACAGGAAGGCAAAATGCCGCAAAAAAGGGAATAAGGG 

C G AC AC GG AAATGTTG AAT ACTC AT ACTCTTCCTTTTTC AAT ATTATTG AAGC ATTT ATC 

AGGGTTATTGTCTCATGAGCGGATACATATTTGAATGTATTTAGAAAAATAAACAAATAG 

GGGTTCCGCGCACATrrCCCCGAAAAGTGCCACCTAAATTGTAAGCGTTAATATTTTGTT 

AAAATTCGCGTTAAATTTrrGTTAAATCAGCT 

CAAAATCCCTTATAAATCAAAAGAATAGACCGAGATAGGGTTGAGTGTTGTTCCAGTTTG 
GAACAAGAGTCCACTATTAAAGAACGTGGACTCCAACGTCAAAGGGCGAAAAACCGTCTA 
TCAGGGCGATGGCCCACTACGTGATAACTTCGTATAATGTATGCTATACGAAGTTATCAC 
T ACGTG AACC ATC ACCCT AATC AAGTTTTTTGGG GTCG AG GTGCCG T AAAGC ACT AAATC 
GG AACCCTAAAGGGAGCCCCCGATTTAGAGCTTGACGGGGAAAGCCAACCTGGCTTATCG 

AAATT AAT ACG ACTC ACT AT AGGGAG ACC GGC 
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SEQ ID No 51 



pONY3.2IREShyg 

agatctcccgatcccctatggtcgactctcagtacaatctgctctgatgccgcatagtta 
agccagtatctgctccctgcttgtgtgttggaggtcgctgagtagtgcgcgagcaaaatt 
5 taagctacaacaaggcaaggcttgaccgacaattgcatgaagaatctgcttagggttagg 
cgttttgcgctgcttcgcgatgtacgggccagatatacgcgttgacattgattattgact 
agttattaatagtaatcaattacggggtcattagttcatagcccatatatggagttccgc 
gtt ac at aactt acg gt aaatg gcccgcctg g ctg acc gccc aacg acccccgccc attg 
acgtcaataatgacgtatgttcccatagtaacgccaatagggactttccattgacgtcaa 

10 tgggtggactatttacggtaaactgcccacttggcagtacatcaagtgtatcatatgcca 
agtacgccccctattgacgtcaatgacggtaaatggcccgcctggcattatgcccagtac 
atgaccttatgggactttcctacttggcagtacatctacgtattagtcatcgctattacc 
atggtgatgcggttttggcagtacaccaatgggcgtggatagcggtttgactcacgggga 
tttcc^gtctccaccccattgacgtcaatgggagtttgttttggcaccaaaatcaacgg 

15 gactttccaaaatgtcgtaacaactgcgatcgcccgccccgttgacgcaaatgggcggta 
ggcgtgtacggtgggaggtctatataagcagagctcgtttagtgaaccgtcagatcacta 
gaagctttattgcggtagtttatcacagttaaattgctaacgcagtcagtgcttctgaca 
c aac agtctc g aactt aagctgcagtgactctcttaaggtagccttgc ag aagttggtc g 
tgaggcactgggcaggtaagtatcaaggttacaagacaggtttaaggagaccaatagaaa 

20 ctgggcttgtcgagacagagaagactcttgcgtttctgataggcacctattggtcttact 
gacatccactttgcctttctctccacaggtgtccactcccagttcaattacagctcttaa 
ggct ag agt actt aat ac g actc act at aggctagcctcg aggtcg ac ggt atc gcccg a 
acagggacctgagaggggcgcagaccctacctgttgaacctggctgatcgtaggatcccc 
gggacagcagaggagaacttacagaagtcttctggaggtgttcctggccagaacacagga 

25 ggacaggtaagatgggagaccctttgacatggagcaaggcgctcaagaagttagagaagg 
tgacggtacaagggtctcagaaattaactactggtaactgtaattgggcgctaagtctag 
tagacttatttcatgataccaactttgtaaaagaaaaggactggcagctgagggatgtca 
ttccarrgctggaagatgtaactcagacgctgtcaggacaagaaagagaggcctttgaaa 
gaacatggtgggcaatttctgctgtaaagatgggcctccagattaataatgtagtagatg 

30 gaaaggcatcattccagctcctaagagcgaaatatgaaaagaagactgctaata aaaag c 
agtctgagccctctgaagaatatccaatcatgatagatggggctggaaacagaaatttta 
gacctctaacacctagaggatatactacttgggtgaataccatacagacaaatggtctat 
t aaatg aagct agtc aaaacttatttggg at attatc agtag actgt acttctg aag aaa 
tgaatgcatttttggatgtggtacctggccaggcaggacaaaagcagatattacttgatg 

3 5 c aattg ataag at agc ag atg attggg ataat ag ac atcc att accg aatgctcc actgg 
tggcaccaccacaagggcctattcccatgacagcaaggtttattagaggtttaggagtac 
ctagagaaagacagatggagcctgcttttgatcagtttaggcagacatatagacaatgga 
taatagaagccatgtcagaaggcatcaaagtgatgattggaaaacctaaagctcaaaata 
ttaggcaaggagcraaggaaccttacccagaatttgtagacagactattatcccaaataa 

40 aaagtgagggacatccacaagagatttcaaaattcttgactgatacactgactattcaga 
acgcaaatgaggaatgtagaaatgctatgagacatttaagaccagaggatacattagaag 
agaaaatgtatgcttgcagagacattggaactacaaaacaaaagatgatgttattggcaa 
aagcacrtcagactggtcttgcgggcccatttaaaggtggagccttgaaaggagggccac 
taaaggcagcacaaacatgttataactgtgggaagccaggacatttatctagtcaatgta 

45 g agc acct aaagtctgtttt aaatgtaaac agcctgg ac atttctc aaagc aatgc ag aa 
gtg7tccaaaaaacgggaagcaaggggctcaagggaggccccagaaacaaactttcccga 
tacaacagaagagtcagcacaacaaatctgttgtacaagagactcctcagactcaaaatc 
tgtacccagatctgagcgaaataaaaaaggaatacaatgtcaaggagaaggatcaagtag 
aggatctcaacctggacagtttgtgggagtaacatataatctagagaaaaggcctactac 

5 0 aatagtatt aattaatgatactcccttaaatgt actgtt ag ac ac agg agc ag atacttc 
agtgttgactactgcacattataataggttaaaatatagagggagaaaatatcaagggac 
gggaataataggagtgggaggaaatgtggaaacattttctacgcctgtgacta taaa gaa 
aaagggtagacacattaagacaagaatgctagtggcagatattccagtgactattttggg 
acgagatattcrtcaggacttaggtgcaaaattggttttggcacagctctccaaggaaat 

5 5 aaaatttagaaaaatagagtt aaaag agggc ac aatggggcc aaaaattcctcaatggcc 
actcactaaggagaaactagaaggggccaaagagatagtccaaagactattgtcagaggg 
aaaaatatcagaagctagtgacaataatccntataattcacccatatttgtaataaaaaa 
gaggtctggcaaatggaggttattacaagatctgagagaattaaacaaaacagtacaagt 
aggaacggaaatatccagaggattgcctcacccgggaggattaattaaatgtaaacacat 

60 gactgtattagatattggagatgcatatttcactatacccttagatccagagtttagacc 

AT AT A C AGCTTTC ACT ATTCCCTCC ATT AATC ATC AAG AACC AG ATAAAAG ATATGTGTG 

gaaatgtttaccacaaggattcgtgttgagcccatatatatatcagaaaacattacagga 
aatttt ac aacctttt aggg aaag at atcctg aagt ac aattgt atc aatat atgg atg a 
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TTTGTTCATGGGAAGTAj\TGGTTCTAAAAAAC.AACACAAAGAGTTAATCATAGAATTAAG 
GGCGATCTTACTGGAAA7\GGGTTTTGAGACACCAGATGATAAATTACAAGA/\GTGCCACC 
TTATAGCTGGCTAGGTTATCAACTTTGTCCTGAAAATTGGAAAGTACAAAAAATGCAATT 
AGACATGGTAAAGAATCCAACCCTTAATGATGTGCAAAAATTAATGGGGAATATAACATG 
5 GATGAGCTCAGGGATCCCAGGGTTGACAGTAAAACACATTGCAGCTACTACTAAGGGATG 
TTTAGAGTTGAATCAAAAAGTAATTTGGACGGAAGAGGCACAAAAAGAGTTAGAAG/VAAA 
TAATGaGAAGATT/VAAAATGCTCAAGGGTTACAATATTATAATCCAGAAGAAGAAATGTT 
ATGTGAGGTTGAAATTACAjNAAAATTATGAGGCAACTTATGTTATAAAACAATCACAAGG 
AATCCT ATGGGC AGGT AAAAAG ATTATG AAGGCT AAT AAGGG ATGGTCAAC AGTAAAAAA 

10 TTTA.ATGTTATTGTTGCAACATGTGGCAACAGAAAGTATTACTAGAGTAGGAAAATGTCC 
AACGTTTAAGGTACCATTTACCAAAGAGCAAGTAATGTGGGAAATGCAAAAAGGATGGTA 
TTATTCTTGGCTCCCAGAAATACTATATACACATCAAOTAGTTCATGATGATTGGAOAAT 
GAAATTGGTAGAAGA-ACCTACATCAGGAATAACAATATACACTGATGGGGGAAAACAAAA 
TGGAGAAGGAATAGCAGCTTATGTGaCCAGTAATGGGAGAACTAAACAGAAAAGGTTAGG 

15 ACCTGTCACTCATCAAGTTGCTGAAAGAATGGCAATaCAAATGGCATTAGAGGATACCAG 
AGAT.\AACAAGTAAATATAGTAACTGATAGTTaTTATTGTTGGAAAAATATTACAGAAGG 
ATTAGGTTTAGAAGGACCACAAAGTCCTTGGTGGCCTATAATACAAAATATACGAGAAAA 

agagatagtttattttgcttgggtacctggtcacaaagggatatatggtaatcaattggc 
agatgaagccgcaaaaataaaagaagaaatcatgctagcataccaaggcacacaaatta/\ 

20 agagaaaagagatgaagatgcagggtttgacttatgtgttccttatgacatcatgatacc 
tgtatctgacacaaaaatcatacccacagatgtaaaaattcaagttcctcctaatagctt 
tggatgggtcactgggaaatcatcaatggcaaaacaggggttattaattaatggaggaat 
aattgatgaaggatatacaggagaaatacaagtgatatgtactaatattggaaaaagtaa 
tatt.a.\attaatagagggacaaaaatttgcacaatta--\ttatactacagcatcactcaaa 

25 ttccagacagccttgggatgaaaataaa.atatctcagagaggggataa.«\ggatttggaag 
tacaggagtattctgggtagaaaatattcaggaagcacaagatgaacatgagaattggca 

TACATCACCAAAGATATTGGC/\AGAAATTAT.^GATACCATTGACTGTAGCAAAACAGAT 

aactcaagaatgtcctcattgcactaagcaaggatcaggacctgcaggttgtgtcatgag 
atctcctaatcattggcaggcagattgcacacatttggacaataagataatattgacttt 

3 0 tgtag agtc aaattc agg atacatac atgctacattattgtcaaaagaaaatgc attatg 
tacttcattggctattttagaatgggcaagattgttttcaccaaagtccttacacacaga 
taacggcactaattttgtggcagaaccagttgtaaatttgttgaagttcctaaagatagc 
acataccacaggaataccatatcatccagaaagtcagggtattgtagaaagggcaaatag 
gaccttgaaagagaagatrcaaagtcatagagacaacactcaaacactggaggcagcttt 

3 5 ac aacttgctctc att acttgt aac aaaggg agg g aaagt atgg g agg ac ag ac acc atg 
ggaagtatttatcactaatcaagcacaagtaatacatgagaj^cttttactacagcaagc 
ac aatcctcc aaaaaattrrgtttttac aaaatccctggtg aac atg attgg aagg g acc 
tactagggtgctgtggaagggtgatggtgcagtagtag7taatgatgaaggaaagggaat 
aattgctgtaccattaaccaggactaagrtactaataaaaccaaattgagtattgttgca 

40 ggaagcaagacccaactaccattgtcagctgtgtttcctgacctcaatatttgttataag 
gtttgatatgaatcccagggggaatctcaacccctattacccaacagtcagaaaaatcta 
agtgtgaggagaacacaatgtttcaaccttattgttataataatgacagtaagaacagca 
tggcagaatcgaaggaagcaagagaccaagaaatgaacctgaaagaagaatctaaagaag 
aaaaaag aag aaatg actggtggaaaat aggt atgtttctgtt atgctt agc AG g aact a 

45 ctggaggaatactttggtggtatgaaggactcccacagcaacattatatagggttggtgg 
cgatagggggaagattaaacggatctggccaatcaaatgctatagaatgctggggttcct 
tcccggggtgtagaccatttcaaaattacttcagttatgagaccaatagaagcatgcata 
tggataataatactgctacattattagaagctttaaccaatataactgctctataaataa 
caaaacagaattagaaacatggaagttagtaaagacttctggcataactcctttacctat 

50 ttcttctgaagctaacactggactaattagacataagagagattttggtataagtgcaat 
agtggcagctattgtagccgctactgctattckztgctagcgctactatgtcttatgttgc 
tctaactgaggttaacaaaataatggaagtacaaaatcatacntttgaggtagaaaatag 
tactctaaatggtatggatttaatagaacgacaaataaagatattatatgctatgattct 
tcaaacacatgcagatgttcaactgttaaaggaaagacaacaggtagaggagacatttaa 

5 5 tttaattggatgtatagaaagaac ac atgtattttgtcatactggtcatccctgg aatat 
gtcatggggacatttaaatgagtcaacacaatgggatgactgggtaagcaaaatggaaga 
tttaaatcaagagatactaactacacttcatggagccaggaacaatttggcacaatccat 
gataacattcaatacaccagatagtatagctcaatttggaaaagacctttggagtcatat 
tggaaattggattcctggattgggagcttccattataauaatatatagtgatgtttttgct 

60 tatttatttgttactaacctcttcgcctaagatcctcagggccctctggaaggtgaccag 
tggtgcagggtcctccggcagtcgttacctgaagaaaaaattccatcacaaacatgcatc 
gcgagaagacacctgggaccaggcccaacacaacatacacctagcaggcgtgaccggtgg 

ATCAGGCKjACAAATACTACAAGCAGAAGTACTCCAGGAACGACTGGAATGGAGAATCAGA 
GGaGTACAACAGGCGGCCAAAGAGCTGGGTGAAGTCAATCGAGGCATTTGGAGAGAGCTA 

65 TATTTCCGAGAAGACCAAAGGGGAGATTTCTCAGCCTGGGGCGGCTATCAACGAGCACAA 
GAACGGCTCTGGGGGGAACAATCCTCACCAAGGGTCCTTAGACCTGGAGATTCGAAGCGA 
AGG AGG AAAC ATTT ATG ACTGTTGC ATT AAAGCCC AAG AAG G AACTCTCGCTATCCCTTG 
CTGTGGATTTCCCITATGGCTATTTTGGGGGTCGACCCGGGCGGCCGCACTAGAGGAATT 
CGCCCCTCrCCCTCCCCCCCCCCTAACGTTACTGGCCGAAGCCGCTTGGAATAAGGCCGG 

70 TGTGTGTTTGTCrATATGTGATTTTCCACCATATTGCCGTCTTTTGGCAATGTGACKjGCC 
CGGAAACCTGGCCCTGTCTTCrrGACGAGCATTCCTAGGGGTCTTTCCCCTCTCGCCAAA 

ggaatgcaaggtctgttgaatgtcgtgaaggaagcagttccrctggaagctrcttgaaga 
caaacaacgtctgtagcgaccctttgcaggcagcggaaccccccacctggcgacaggtgc 
ctctgcggccaaaagccacgtgtataagatacacctgcaaaggcggcacaaccccagtgc 
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CACGTTGTGAGTTGGATAGTTGTGGAAAGAGTCAAATGGCTCTCCTCAAGCGTAGTCAAC 
AAGGGGCTGA/XGGATGCCCAGAAGGTACCCCATTGTATGGGAATCTGATCTGGGGCCTCG 
GTGCACATGCTTTACATGTGTTTAGTCGAGGTTAAAAAAGCTCTAGGCCCCCCGAACCAC 
GGGGACGTGGTTTTCCTTTGAAAAACACGATGATAAGCTTGCCACAACCCCGTACCAAAG 
5 ATG G AT AG ATCCG G AA A G C CTG AACTC ACCGCG AC GTCTGTC G AG AAGTTTCTG ATCG AA 
AAGTTCGACAGCGTCTCCGACCTGATGCAGCTCTCGGAGGGCGAAGAATCTCGTGCTTTC 
AGCTTCGATGTAGGAGGGCGTGGATATGTCCTGCGGGTAAATAGCTGCGCCGATGGTTTC 
TACAAAGATCGTTATGTTTATCGGCACTTTGCATCGGCCGCGCTCCCGATTCCGGAAGTG 
CTTGACATTGGGGAATTCAGCGAGAGCCTGACCTATTGCATCTCCCGCCGTGCACAGGGT 
10 GTCACGTTGCAAGACCTGCCTGAAACCGAACTGCCCGCTGTTCTGCAGCCGGTCGCGGAG 
GCCATGGATGCGATCGCTGCGGCCGATCTTAGCCAGACGAGCGGGTTCGGCCCATTCGGA 

ccgcaaggaatcggtcaatacactacatggcgtgatttcatatgcgcgattgctgatccc 
catgtgtatcactggcaaactgtgatggacgacaccgtcagtgcgtccctcgcgcaggct 
ctcgatgagctgatgctttgggccgaggactgccccgaagtccggcacctcgtgcacgcg 

15 gatttcggctccaacaatgtcctgacggacaatggccgcataacagcggtcattgactgg 
agcgaggcgatgttcggggattcccaatacgaggtcgccaacatcttcttctggaggccg 
tggttggcttgtatggagcagcagacgcgctacttcgagcggaggcatccggagcttgca 
ggatcgccgcggctccgggcgtatatgctccgcattggtcttgaccaactctatcagagc 
ttggttgacggcaatttcgatgatgcagcttgggcgcagggtcgatgcgacgcaatcgtc 

20 cgatccggagccgggactgtcgggcgtacacaaatcgcccgcagaagcgcggccgtctgg 
accgatggctgtgtagaagtactcgccgatagtggaaaccgacgccccagcactcgtccg 
agggcaaaggaatagagtagatgccgaccgaacaagagctgatttcgagaacgcctcagc 
cagcaactcgcgcgagcctagcaaggcaaatgcgagagaacggccttacgcttggtggca 
cagttctcgtccacagttcgctaagctcgctcggctgggtcgcgggagggccggtcgcag 

25 tgattcaggcccttctggattgtgttggtccccagggcacgattgtcatgcccacgcact 
cgggtgatctgactgatcccgcagattggagatcgccgcccgtgcctgccgattgggtgc 
agatctagagctcgctgatcagcctcgactgtgcctctagttgccagccatctgttgttt 
gcccctcccccgtgccttccttgaccctggaaggtgccactcccactgtcctttcctaat 
aaaatgaggaaattgcatcgcattgtctgagtaggtgtcattctattctggggggtgggg 

30 tggggcaggacagcaagggggaggattgggaagacaatagcaggcatgctggggatgcgg 
tgggctctatggcttctgaggcggaaagaaccagctggggctcgagtgcattctagttgt 
ggtttgtccaaactcatcaatgtatcttatcatgtctgtataccgtcgacctctagctag 
agcttggcgtaatcatggtcatagctgtttcctgtgtgaaattgttatccgctcacaatt 
ccacacaacatacgagccggaagcataaagtgtaaagcctggggtgcctaatgagtgagc 

3 5 taactcacattaattgcgttgcgctcactgcccgctttcc agtcgggaaacctgtcgtgc 
cagctgcattaatgaatcggccaacgcgcggggagaggcggtttgcgtattgggcgctct 
tccgcttcctcgctcactgactcgctgcgctcggtcgttcggctgcggcgagcggtatca 
gctcactcaaaggcggtaatacggttatccacagaatcaggggataacgcaggaaagaac 
atgtgagcaaaaggccagcaaaaggccaggaaccgtaaaaaggccgcgttgctggcgttt 

40 ttccataggctccgcccccctgacgagcatcacaaaaatcgacgctcaagtcagaggtgg 
cgaaacccgacaggactataaagataccaggcgtttccccctggaagctccctcgtgcgc 
tctcctgttccgaccctgccgcttaccggatacctgtccgcctttctcccttcgggaagc 
gtggcgctttctcaatgctcacgctgtaggtatctcagttcggtgtaggtcgttcgctcc 
aagctgggctgtgtgcacgaaccccccgttcagcccgaccgctgcgccttatccggtaac 

45 tatcgtcttgagtccaacccggtaagacacgacttatcgccactggcagcagccactggt 
aacaggattagcagagcgaggtatgtaggcggtgctacagagttcttgaagtggtggcct 
aactacggctacactagaaggacagtatttggtatctgcgctctgctgaagccagttacc 
ttcggaaaaagagttggtagctcttgatccggcaaacaaaccaccgctggtagcggtggt 
ttttttgtttgcaagcagcagattacgcck:agaaaaaaaggatctcaagaagatc 

50 atcttttctacggggtctgacgctcagtggaacgaaaactcacgttaaggga ttttg gtc 
atgagattatcaaaaaggatcttcacctagatccttttaaattaaaaatgaagttttaaa 
tcaatctaaagtatatatgagtaaacttggtctgacagttaccaatgcttaatcagtgag 
gcacctatctcagcgatctgtctatttcgttcatccatagttgcctgactccccgtcgtg 
tagataactacgatacgggagggcttaccatctggccccagtgctgcaatgataccgcga 

55 gacccacgctcaccggctccagatttatcagcaataaaccagccagccggaagggccgag 
cgcagaagtggtcctgcaactttatccgcctccatccagtctattaattgttgccgggaa 
gctagagtaagtagttcgccagttaatagtttgcgcaacgttgttgccattgctacaggc 
atcgtggtgtcacgctcgtcgtttggtatggcttcattcagctccggttcccaacgatca 
aggcgagttacatgatcccccatgttgtgcaaaaaagcggttagctccttcggtcctccg 

60 atcgttgtcagaagtaagttggccgcagtgttatcactcatggttatggcagcactgcat 
aattctcttactgtcatgccatccgtaagatgcttttctgtgactggtgagtactcaacc 
aagtcattctgagaatagtgtatgcggcgaccgagttgctcttgcccggcgtcaatacgg 
gataataccgcgccacatagcagaactttaaaagtgctcatcattggaaaacgttcttcg 
gggcgaaaactctcaaggatcttaccgctgttgagatccagttcgatgtaacccactcgt 

65 gcacccaactgatcttcagcatcntrtacrrrcaccagcgtttctgggtgagcaaaaaca 
ggaaggcaaaatgccgcaaaaaagggaataagggcgacacggaaatgttgaatactcata 
ctcttcctttttcaatattattgaagcatttatcagggttattgtctcatgagcggatac 
atatttgaatgtatttagaaaaataaacaaataggggttccgcgcacatttccccgaaaa 
gtgccacctgacgtcgacggatcggg 

70 

pONY8ZA CMVHyb (SEQ ID No 52) 

AGATCTTGAATAATAAAATGTGTGTTTGTCCGAAATACGCGTTTTGAGATTTCTGTCGCC 
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GACTAAATTCATGTCGCGCGATAGTGGTGTTTATCGCCGATAGAGATGGCGATATTGGAA 
.AAATTGATATTTGAAAATATGGCATATTGAAAATGTCGCCGATGTGAGTTTCTGTGTAAC 
TG AT ATCGCC ATTTTTCC AAAAGTG ATTTTTG G GC AT AC GCG AT ATCTGG C G AT AGCGCT 
T AT ATCGTTT AC G G G GG ATG GCG AT AG ACG ACTTTGGTG ACTTGGGC G A TTCTGTGTGTC 
5 GCAAATATCGCAGTTTCGATATAGGTGACAGACGATATGAGGCTATATCGCCGATAGAGG 
CGACATCAAGCTGGCACATGGCCAATGCATATCGATCTATACATTGAATCAATATTGGCC 
ATT AGC C AT ATT ATTC ATTG G TT AT AT AGC AT AAATC AAT ATTG GCT A TTG G CC ATTGC A 
TACGTTGTATCCATATCGTAATATGTACATTTATATTGGCTCATGTCCAACATTACCGCC 
ATGTTGACATTGATTATTGACTAGTTATTAATAGTAATCAATTACGGGGTCATTAGTTCA 
10 TAGCCCATATATGGAGTTCCGCGTTACATAACTTACGGTAAATGGCCCGCCTGGCTGACC 
GCCCAACGACCCCCGCCCATTGACGTCAATAATGACGTATGTTCCCATAGTAACGCCAAT 
AGGGACTTTCCATTGACGTCA.ATGGGTGGAGTATTTACGGTAAACTGCCCACTTGGCAGT 
ACATCAAGTGTATCATATGCCAAGTCCGCCCCCTATTGACGTCAATGACGGTAAATGGCC 
CGCCTGGCATTATGCCCAGTACATGACCTTACGGGACTTTCCTACTTGGCAGTACATCTA 
15 CGTATTAGTCATCGCTATTACCATGGTGATGCGGTTTTGGCAGTACACCAATGGGCGTGG 
ATAGCGGTTTGACTCACGGGGATTTCCAAGTCTCCACCCCATTGACGTCAATGGGAGTTT 
GTTTTGGCACCAAAATCAACGGGACTTTCCAAAATGTCGTAACAACTGCGATCGCCCGCC 
CCGTTGACGCAAATGGGCGGTAGGCGTGTACGGTGGGAGGTCTATATAAGCAGAGCTCGT 
TTAGTGAACCGGGCACTCAGATTCTGCGGTCTGAGTCCCTTCTCTGCTGGGCTGAAAAGG 
20 CCTTTGTAATAAATATAATrCTCrACTCAGTCCCTGTCrCTAGTTTGTCTGTTCGAGATC 

CT AC AGTTG GCG CC C G AAC AGGG AC CTG AG AGGGGC GC AG ACCCT ACCTGTTG AACCTGG 
CTGATCGTAGGATCCCCGGGACAGCAGAGGAGAACTTACAGAAGTCTTCTGGAGGTGTTC 
CTGGCCAGAACACAGGAGGACAGGTAAGATTGGGAGACCCTTTGACATTGGAGCAAGGCG 
CTCAAGAAGTTAGAGAAGGTGACGGTACAAGGGTCTCAGAAATTAACTACTGGTAACTGT 
2 5 AATTGGGCGCT AAGTCTAGT AG ACTT ATTTC ATTG AT ACC AACTTTGTAAAAG AAAAGG A 
CTGGCAGCTGAGGGATTGTCATTCCATTGCTGGAAGATTGTAACTCAGACGCTGTCAGGA 
C AAG AAAG AG AGGCCTTTG AAAG AAC ATT GGTG GGC AATTTCTGCTGT AAAG ATTGGGCC 
TCCAGATTAATAATTGTAGTAGATTGGAAAGGCATCATTCCAGCTCCTAAGAGCGAAATA 
TTGAAAAGAAGACTGCTAATAAAAAGCAGTCTGAGCCCTCTGA AGAA TATCTCTAGAACT 
30 AGTGGATCCCCCGGGCTGCAGGAGTGGGGAGGCACGATGGCCGCTTTGGTCGAGGCGGAT 
CCGGCCATTAGCCATATTATTCATTGGTTATATAGCATAAATCAATATTGGCTATTGGCC 
ATTGCATACGTTGTATCCATATCATAATATGTACATTTATATTGGCTCATGTCCAACATT 
ACCGCCATGTTGACATTGATTATTGACTAGTTATTAATAGTAATCAATTACGGGGTCATT 
AGTTCAT AGCCC ATAT ATGG AGTTCC GCGTTAC AT AACTT ACGGT AAATGGCCCGCCTG G 
35 CTGACCGCCCAACGACCCCCGCCCATTGACGTCAATAATGACGTATGTTCCCATAGTAAC 
GCCAATAGGGACTTTCCATTGACGTCAATGGGTGGAGTATTTACGGTAAACTGCCCAC7T 
GGCAGTACATCAAGTGTATCATATGCCAAGTACGCCCCCTATTGACGTCAATGACGGTAA 
ATGGCCCGCCTGGCATTATGCCCAGTACATGACCTTATGGGACTTTCCTACTTGGCAGTA 
CATCTACGTATTAGTCATCGCTATTACCATGGTGATGCGGTTTTGGCAGTACATCAATGG 
40 GCGTGGATAGCGGTTTGACTCACGGGGATTTCCAAGTCTCCACCCCATTGACGTCAATGG 
GAGTTTGTTTTGGCACCAAAATCAACGGGACTTTCCAAAATGTCGTAACAACTCCGCCCC 
ATTGACGCAAATGGGCGGTAGGCATGTACGGTGGGAGGTCTATATAAGCAGAGCTCGTTT 
AGTGAACCGTCAGATCGCCTGGAGACGCCATCCACGCTGTTTTGACCTCCATAGAAGACA 
CCGGGACCGATCCAGCCTCCGCGGCCCCAAGCTTCAGCTGCTC GAGG ATCTGCGGATCCG 
45 GGGAATTCCCCAGTCTCAGGATCCACCATGGGGGATCCCGTCGTTTTACAACGTCGTGAC 
TGGGAAAACCCrGGCGTTACCCAACTTAATCGCCTTGCAGCACATCCCCCTTTCGCCAGC 
TGGCGTAATAGCGAAGAGGCCCCTCACCGATCGCCCTTCCCAACAGTTGCGCAGCCTGAAT 
GGCGAATGGCGCTTTGCCTGGTTTCCGGCACCAGAAGCGGTGCCGGAAAGCTGGCTGGAG 
TGCGATCTTCCTGAGGCCGATACTGTCGTCGTCCCCTCAAACTGGCAGATGCACGGTTAC 
50 GATGCGCCCATCTACACCAACGTAACCTATCCCATTACGGTCAATCCGCCGTTTGTTCCC 
ACGGAGAATCCGACGCKjTTGTTACTCGCTCACATTTAATGTTGATGAAAGCTGGCTACAG 
GAAGGCCAGACGCGAATTATTTTTGATGGCGTTAACTCGGCGTTTCATCTGTGGTGCAAC 
GGGCGCTGGGTCGGTTACGGCCAGGACAGTCGTTTGCCGTCTGAATTTGACCTGAGCGCA 
I 1 1 1 1 ACGCGCCGGAGAAAACCGCCTCGCGGTGATGGTGCTGCGTTGGAGTGACGGCAGT 
5 5 T ATCTGG AAGATCAGGATATGTGGCGGATGAGCGGCATTTTCCG TGAC GTCTCGTTGCTG 
CATAAACCGACTACACAAATCAGCGATrTCCATGTrGCCACTCGCnTrAATGATGATTTC 
AGCCGCGCTGTACTGGAGGCTGAAGTTCAGATGTGCGGCGAGTTGCGTGACT ACCT ACGG 
GTAACAGTTTCTTTATGGCAGGGTGAAACGCAGGTCGCCAGCGGCACCGCGCCTTTCGGC 
GGTGAAATTATCGATGAGCGTGGTGGTTATGCCGATCGCGTCACACTACGTCTGAACGTC 
60 GAAAACCCGAAACTGTGGAGCGCCGAAATCCCGAATCTCTATCGTGCGGTGGTTGAACTG 
CACACCGCCGACGGCACGCTGATTGAAGCAGAAGCCTGCGATGTCGGTTrCCGCGAGGTG 
CGGATTGAAAATGGTCTGCTGCTGCTGAACGGCAAGCCGTTGCTGATTCGAGGCGTTAAC 
CGTCACGAGCATCATCCTCTGCATGGTCAGGTCATGGATGAGCAGACGATGGTGCAGGAT 
ATCCTGCTGATGAAGCAGAACAACTTTAACGCCGTGCGCTGTTCGCATTATCCGAACCAT 
65 CCGCTGTGGTACACGCTGTGCGACCGCTACGGCCTGTATGTGGTGGATGAAGCCAATATT 
GAAACCCACGGCATGGTGCCAATGAATCGTCTGACCGATGATCCGCGCTGGCTACCGGCG 
ATGAGCGAACGCGTAACGCGAATGGTGCAGCGCGATCGTAATCACCCGAGTGTGATCATC 
TGGTCGCTGGGGAATGAATCAGGCCACGGCGCTAATCACGACGCGCTGTATCGCTGGATC 
AAATCTGTCGATCCTTCCCGCCCGGTGCAGTATGAAGGCGGCGGAGCCGACACCACGGCC 
70 ACCGATATTATTTGCCCGATGTACGCGCGCGTGGATGAAGACCAGCCCTTCCCGGCTGTG 
CCGAAATGGTCCATCAj\AAAATGGCTTTCGCTACCTGGAGAGACGCGCCCGCTGATCCTT 

tgcgaatacgcccacgcgatgggtaacagtcttggcggtttcgctaaatactggcaggcg 

tttcgtcagtatccccgtttacagggcggcttcgtctgggactgggtg gatc agtcgctg 

attaaatatgatgaaaacggcaacccgtggtcggcttacggcggtgattttggcgatacg 
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CCG A AC G ATCG CC AGTTCTG T ATG A AC C GTCTG GTCTTTCC CG ACCCC AC GCCGC ATC C A 
GCGCTGACGGAAGCAAAACACCAGCAGCAGTTTTTCCAGTTCCGTTTATCCGGGCAAACC 
ATCGAAGTGACCAGCGAATACCTGTTCCGTCATAGCGATAACGAGCTCCTGCACTGGATG 
GTGGCGCTGGATGGTAAGCCGCTGGCAAGCGGTGAAGTGCCTCTGGATGTCGCTCCACAA 
5 GGTAAACAGTTGATTGAACTCCCTGAACTACCGCAGCCGGAGAGCGCCGGGCAACTCTGG 
CTCACAGTACGCGTAGTGCAACCGAACCCGACCGCATGGTCAGAAGCCGGGCACATCAGC 
GCCTGGCAGCAGTGGCGTCTGGCGGAAAACCTCAGTGTGACGCTCCCCGCCGCGTCCCAC 
GCCATCCCGCATCTGACCACCAGCGAAATGGATTTTTGCATCGAGCTGGGTAATAAGCGT 
TGGCAATTTAACCGCCAGTCAGGCTTTCTTTCACAGATGTGGATTGGCGATAAAAAACAA 

10 CTGCTGACGCCGCTGCGCGATCAGTTCACCCGTGCACCGCTGGATAACGACATTGGCGTA 
AGTGAAGCGACCCGCATTGACCCTAACGCCTGGGTCGAACGCTGGAAGGCGGCGGGCCAT 
TACCAGGCCGAAGCAGCGTTGTTGCAGTGCACGGCAGATACACTTGCTGATGCGGTGCTG 
ATT ACG ACCGCTC ACGCGT GG C AGC ATC AGG GG AAA ACCTT ATTT ATC AGCCGG AAAACC 
TACCGGATTGATGGTAGTGGTCAAATGGCGATTACCGTTGATGTTGAAGTGGCGAGCGAT 

15 ACACCGCATCCGGCGCGGATTGCCCTGAACTGCCAGCTGGCGCAGGTAGCAGAGCGGGTA 
AACTGGCTCGGATTAGGGCCGCAAGAAAACTATCCCGACCGCCTTACTGCCGCCrGTTTT 
GACCGCTGGGATCTGCCATTGTCAGACATGTATACCCCGTACGTCTTCCCGAGCGAAAAC 
GGTCTGCGCTGCGGGACGCGCGAATTGAATTATGGCCCACACCAGTGGCGCGGCGACTTC 
CAGTTCAACATCAGCCGCTACAGTCAACAGCAACTGATGGAAACCAGCCATCGCCATCTG 

20 CTGCACGCGGAAGAAGGCACATGGCTGAATATCGACGGTTTCCATATGGGGATTGGTGGC 
GACGACTCCTGGAGCCCGTCAGTATCGGCGGAATTCCAGCTGAGCGCCGGTCGCTACCAT 
TACCAGTTGGTCTGGTGTCAAAAATAATAATAACCGGGCAGGGGGGATCCGCAGATCCGG 
CTGTGGAATGTGTGTCAGTTAGGGTGTGGAAAGTCCCCAGGCTCCCCAGCAGGCAGAAGT 
ATGC AAAGC ATGCCTGC AGG AATTCG ATATC AAGCTTATCG ATACCGTCGACCTCGAGG G 

25 GGGGCCCGGTACCCAGCTTTTGTTCCCTTrAGTGAGGGTTAATTGCGCGGGAAGTATTTA 
TCACTAATCAAGCACAAGTAATACATGAGAAACTTTTACTACAGCAAGCACAATCCTCCA 
AAAAATTTTG I I 1 IT ACAAAATCCCTGGTGAACATGATTGGAAGGGACCTACTAGGGTGC 
TGTGGAAGGGTGATGGTGCAGTAGTAGTTAATGATGAAGGAAAGGGAATAATTGCTGTAC 
CATTAACCAGGACTAAGTTACTAATAAAACCAAATTGAGTATTGTTGCAGGAAGCAAGAC 

3 0 CC AACTACC ATTGTCAGCTGTGTTTCCTG ACCTCAATATTTGTTATAAGGTTTGATATG A 

ATCCC AG G GGG AATCTC AAC CCCTATT ACCC AAC AGTC AG AAAAATCT AAGTGTG AGG AG 
AACACAATGTTTCAACCTTATTGTTATAATAATGACAGTAAGAACAGCATGGCAGAATCG 
AAGGAAGCAAGAGACCAAGAATGAACCTGAAAGAAGAATCTAAAGAAGAAAAAAGAAGAA 
ATGACTGGTGGAAAATAGGTATGTTTCTGTTATGCTTAGCAGGAACTACTGGAGGAATAC 

35 TTTGGTGGTATGAAGGACTCCCACAGCAACATTATATAGGGTTGGTGGCGATAGGGGGAA 
GATTAAACGGATCTGGCCAATCAAATGCTATAGAATGCTGGGGTTCCTTCCCGGGGTGTA 
GACCATTTCAAAATTACTTCAGTTATGAGACCAATAGAAGCATGCATATGGATAATAATA 
CTGCTACATTATTAGAAGCITTAACCAATATAACTGCTCTATAAATAACAAAACAGAATT 
AGAAACATGGAAGTrAGTAAAGACTTCTGGCATAACTCCTTTACCTATTTCTTCTGAAGC 

40 TAACACTGGACTAATTAGACATAAGAGAGATTTTGGTATAAGTGCAATAGTGGCAGCTAT 
TGTAGCCGCTACTGCTATTGCTGCTAGCGCTACTATGTCnTATGTTGCTCTAACTGAGGT 
TAACAAAATAATGGAAGTACAAAATCATACTTTTGAGGTAGAAAATAGTACTCTAAATGG 
T ATGG ATTT AAT AG AACG AC AAAT AAAG AT ATT AT ATGCT ATG ATTCTTC AAAC AC ATGC 
AGATGTTCAACTGTTAAAGGAAAGACAACAGGTAGAGGAGACATTTAATTTAATTGGATG 

45 TATAGAAAGAACACATGTATTTTGTCATACTGGTCATCCCTGGAATATGTCATGGGGACA 
TTTAAATG AGTC AAC AC AATGGG ATG ACTGG GTAAGCAAAATGG AAG ATTT AAATCAAG A 
G AT ACT AACT AC ACTTC ATGG AGCC AGG AAC AATTTG GC AC AATCC ATG AT AAC ATTC AA 
TACACCAGATAGTATAGCTCAATTrGGAAAAGACCTTTGGAGTCATATTGGAAATTGGAT 
TCCTGG ATTGGG AGCTTCCATTAT AAAATATATAGTGATG I" I I 1 I GCTTATTT ATTTGTT 

50 ACTAACCTCTTCGCCTAAGATCCTCAGGGCCCTCTGGAAGGTGACCAGTGGTGCAGGGTC 
CTCCGGCAGTCGTTACCTGAAGAAAAAATTCCATCACAAACATGCATCGCGAGAAGACAC 
CTGGGACCAGGCCCAACACAACATACACCTAGCAGGCGTGACCGGTGGATCAGGGGACAA 
AT ACT AC AAGC AG AAGT ACTCC AGG AACG ACTG G AATGG AG AATC AG AGG AGTAC AAC AG 
GCGGCCAAAGAGCTGGGTGAAGTCAATCGAGGCATTTGGAGAGAGCTATATTTCCGAGAA 

55 GACCAAAGGGGAGATTTCTCAGCCTGGGGCGGCTATCAACGAGCACAAGAACGGCTCTGG 
GGG G AAC AATCCTC ACCAAGGGTCCTTAGACCTGG AG ATTCG AAGCGAAGGAGG AAAC AT 
TTATGACTGTTGCATTAAAGCCCAAGAAGGAACTCTCGCTATCCCTTGCTGTGGATTTCC 
CTTATGGCTATTTTGGGGACTAGTAATTATAGTAGGACGCATAGCAGGCTATGGATTACG 
TGGACTCGCTGTTATAATAAGGATrrGTATTAGAGGCTTAAATTTGATATTTGAAATAAT 

60 CAGAAAAATGCTTGATTATATTGGAAGAGCTTTAAATCCTGGCACATCTCATGTATCAAT 
GCCTC AGTATGTTT AG AAAAACAAGGG GGG AACTGTGGGG' I 1 1 I I ATG AGGGGTTTTATA 
AACTGCAGGAGTGGGGAGGCACGATGGCCGCTTTGGTCGAGGCGGATCCGGCCATTAGCC 
ATATTATTCATTGGTTATATAGCATAAATCAATATTGGCTATTGGCCATTGCATACGTTG 
TATCCATATCATAATATGTACATTTATATTGGCTCATGTCCAACATTACCGCCATGTTGA 

65 CATrGATTATTGACTAGTTATTAATAGTAATCAATTACGGGGTCATTAGTTCATAGCCCA 
TATATGGAGTTCCGCGTTACATAACTTACGGTAAATGGCCCGCCTGGCTGACCGCCCAAC 
GACCCCCGCCCATTGACGTCAATAATGACGTATGTTCCCATAGTAACGCCAATAGGGACT 
TTCCATTGACGTCAATGGGTGGAGTATTTACGGTAAACTGCCCACTTGGCAGTACATCAA 
GTGTATCATATGCCAAGTACGCCCCCTATTGACGTCAATGACGGTAAATGGCCCGCCTGG 

70 CATTATGCCCAGTACATGACCrrATGGGACTTTCCTACTTGGCAGTACATCTACGTATTA 
GTCATCGCTATTACCATGGTGATGCGGTTTTGGCAGTACATCAATGGGCGTGGATAGCGG 
TTTGACTCACGGGGATTTCCAAGTCTCCACCCCATTGACGTCAATGGGAGTTTGTTTTGG 
C ACC AAAATC AAC GG G ACTTTC C AAAATGTC GT AAC AACTCCGCCCCATTG ACGC AAATG 
GGCGGTAGGCATGTACGGTGGGAGGTCTATATAAGCAGAGCTCGTTTAGTGAACCGGGCA 
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CTCAGATTCTGCGGTCTGAGTCCCTTCTCTGCTGCGCTG/VAAAGGCCTTTGTAATAAATA 
TAATTCTCTACTCAGTCCCTGTCTCTAGTTTGTCTGTTCGAGATCCTACAGAGCTCATGC 
CTTG GCG T AA TC ATG GTC AT AGCTGTTTCCTGTGTG AAATTGTT ATCC G CTC AC AATTC C 
ACACAACATACGAGCCGGAAGCATAAAGTGTAAAGCCTGGGGTGCCTAATGAGTGAGCTA 
5 ACTCACATTAATTGCGTTGCGCTCACTGCCCGCTTTCCAGTCGGGAAACCTGTCGTGCCA 
GCTGC ATT AATG AATC G GCC AAC GC GC G G G G AG AG GC G GTTTGCGT ATTG GGCGCTCTTC 
CGCTTCCTCGCTCACTGACTCGCTGCGCTCGGTCGTTCGGCTGCGGCGAGCGGTATCAGC 
TCACTCAAAGGCGGTAATACGGTTATCCACAGAATCAGGGGATAACGCAGGAAAGAACAT 
GTGAGCAAAAGGCCAGCAAAAGGCCAGGAACCGTAAAAAGGCCGCGTTGCTGGCGTTTTT 

10 CCATAGGCTCCGCCCCCCTGACGAGCATCACAAAAATCGACGCTCAAGTCAGAGGTGGCG 
AAACCCGACAGGACTATAAAGATACCAGGCGTTTCCCCCTGGAAGCTCCCTCGTGCGCTC 
TCCTGTTCCGACCCTGCCGCTTACCGGATACCTGTCCGCCrTTCTCCCTTCGGGAAGCGT 
GGCGCTTTCTCATAGCTCACGCTGTAGGTATCTCAGTTCGGTGTAGGTCGTTCGCTCCAA 
GCTGGGCTGTGTGCACGAACCCCCCGTTCAGCCCGACCGCTGCGCCTTATCCGGTAACTA 

15 rCGTCTTGAGTCCAACCCGGTAAGACACGACTTATCGCCACTGGCACCAGCCACTGGTAA 
CAGGATTAGCAGAGCGAGGTATGTAGGCGGTGCTACAGAGTTCTTGAAGTGGTGGCCTAA 
CT AC G G CT AC ACT AG AAG G AC AGT ATTTGGTATCTGC GCTCTGCTG AAGC C AGTT ACCTT 
CGGAAAAAGAGTTGGTAGCTCTTGATCCGGCAAACAAACCACCGCTGGTAGCGGTGGTTT 
TTTTGTTTGCAAGCAGCAGATTACGCGCAGAAAAAAAGGATCTCAAGAA GATCC TTTGAT 

20 CTTTTCTACGGGGTCTGACGCTCAGTGGAACGAAAACTCACGTTAAGGGATTTTGGTCAT 
GAGATTATCAAAAAGGATCTTCACCTAGATCCTTTTAAATTAAAAATGAAGTTTTAAATC 
AATCTAAAGTATATATGAGTAAACTTGGTCTGACAGTTACCAATGCTTAATCAGTGAGGC 
ACCTATCTCAGCGATCTGTCTATTTCGTTCATCCATAGTTGCCTGACTCCCCGTCGTGTA 
GATAACTACGATACGGGAGGGCTTACCATCTGGCCCCAGTGCTGCAATGATACCGCGAGA 

25 CCCACGCTCACCGGCTCCAGATTTATCAGCAATAAACCAGCCAGCCGGAAGGGCCGAGCG 
CAGAAGTGGTCCTGCAACTTTATCCGCCTCCATCCAGTCTATTAATTG7TGCCGGGAAGC 
TAGAGTAAGTAGTTCGCCAGTTAATAGTTTGCGCAACGTTGTTGCCATTGCTACAGGCAT 
CGTGGTGTCACGCTCGTCGTTTGGTATGGCTTCATTCAGCTCCGGTTCCCAACGATCAAG 
GCG AGTT AC ATG ATCCCCC ATGTTGTGC AAAAAAGCGGTT AGCTCCTTC GGTCCTCCG AT 

30 CGTTGTCAGAAGTAAGTTGGCCGCAGTGTTATCACTCATGGTTATGGCAGCACTGCATAA 
TTCTCTTACTGTCATGCCATCCGTAAGATGCTTTTCTGTGACTGGTGAGTACTCAACCAA 
GTCATTCTGAGAATAGTGTATGCGGCGACCGAGTTGCTCTTGCCCGGCGTCAATACGGGA 
TAATACCGCGCCACATAGCAGAACTTTAAAAGTGCTCATCATTGGAAAACGTTCTTCGGG 
GCGAAAACTCTCAAGGATCTTACCGCTGTTGAGATCCAGTTCGATGTAACCCACTCGTGC 

3 5 ACCCAACTG ATCTTCAGCATCTTTTACTTTC ACCAGCGTTTCTGGGTG AGC AAAAAC AGG 

aaggcaaaatgccgcaaaaaagggaataagggcgacacggaaatgttgaatactcatact 
cttc crn 1 i caatattattgaagc attt atc agggttattgtctcatgagcggatacat 
atttg aatgt attt ag aaaaat aaac aaat agg ggttcc gc gc ac atttccccg aaaagt 
gccacctaaattgtaagcgtraatattrtgttaaaattcgcgttaaatttttgttaaatc 

40 agctcattttttaaccaataggccgaaatcggcaaaatcccttataaatcaaaagaatag 
accgagatagggttgagtgttgttccagtttggaacaagagtccactattaaagaacgtg 
gactccaacgtcaaagggcgaaaaaccgtctatcagggcgatggcccactacgtgaacca 
tcaccctaatcaagttttttggggtcgaggtgccgtaaagcactaaatcggaaccctaaa 
gggagcccccgatttagagcttgacggggaaagccaacctggcttatcgaaattaatacg 

45 actcactatagg 
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5 

PEsynGP (SEQ ID No 53) 

TCAATATTGGCCATTAGCCATATTATTCATTGGTTATATAGCATAAATCAATATTGGCTA 
TTGGCCATTGCATACGTTGTATCTATATCATAATATGTACATTTATATTGGCTCATGTCC 

10 AATATGACCGCCATGTTGGCATTGATTATTGACTAGTTATTAATAGTAATCAATTACGGG 
GTCATTAGTTCATAGCCCATATATGGAGTTCCGCGTTACATAACTTACGGTAAATGGCCC 
GCCTGGCTGACCGCCCAACGACCCCCGCCCATTGACGTCAATAATGACGTATGTTCCCAT 
AGTAACGCCAATAGGGACTTTCCATTGACGTCAATGGGTGGAGTATTTACGGTAAACTGC 
CCACTTGGCAGTACATCAAGTGTATCATATGCCAAGTCCGCCCCCTATTGACGTCAATGA 

15 CGGTAAATGGCCCGCCTGGCATTATGCCCAGTACATGACCTTACGGGACTTTCCTACTTG 
GCAGTACATCTACGTATTAGTCATCGCTATTACCATGGTGATGCGGTTTTGGCAGTACAC 
CAATGGGCGTGGATAGCGGTTTGACTCACGGGGATTTCCAAGTCTCCACCCCATTGACGT 
CAATGGGAGTTTGTTTTGGCACC.AAAATCAACGGGACTTTCCAAAATGTCGTAACAACTG 
CGATCGCCCGCCCCGTTGACGCAAATGGGCGGTAGGCGTGTACGGTGGGAGGTCTATATA 

20 AGCAGAGCTCGTTTAGTGAACCGTCAGATCACTAGAAGCTTTATTGCGGTAGTTTATCAC 
AGTTAAATTGCTAACGCAGTCAGTGCTTCTGACACAACAGTCTCGAACTTAAGCTGCAGT 
GACTCTCTTAAGGTAGCCTTGCAGAAGTTGGTCGTGAGGCACTGGGCAGGTAAGTATCAA 
GGTTACAAGACAGGTTTAAGGAGACCAATAGAAACTGGGCTTGTCGAGACAGAGAAGACT 
CTTGCGTTTCTGATAGGCACCTATTGGTCTTACTGACATCCACTTTGCCTTTCTCTCCAC 

25 AGGTGTCCACTCCCAGTTCAATTACAGCTCTTAAGGCTAGAGTACTTAATACGACTCACT 
ATAGGCTAGAGAATTCGCCACCATGGGCGATCCCCTCACCTGGTCCAAAGCCCTGAAGAA 
ACTGGAAAAAGTCACCGTTCAGGGTAGCCAAAAGCTTACCACAGGCAATTGCAACTGGGC 
ATTGTCCCTGGTGGATCITTTCCACGACACTAATTTCGTTAAGGAGAAAGATTGGCAACT 
CAGAGACGTGATCCCCCTCTTGGAGGACGTGACCCAAACATTGTCTGGGCAGGAGCGCGA 

30 AGCTTTCGAGCGCACCTGGTGGGCCATCAGCGCAGTCAAAATGGGGCTGCAAATCAACAA 
CGTGGTTGACGGTAAAGCTAGCTTTCAACTGCTCCGCGCTAAGTACGAGAAGAAAACCGC 
CAACAAGAAACAATCCGAACCTAGCGAGGAGTACCCAATTATGATCGACGGCGCCGGCAA 
TAGGAACTTCCGCCCACTGACTCCCAGGGGCTATACCACCTGGGTCAACACCATCCAGAC 
AAACGGACTTTTGAACGAAGCCTCCCAGAACCTGTTCGGCATCCTGTCTGTGGACTGCAC 

35 CTCCGAAGAAATGAATGCTTTTCTCGACGTGGTGCCAGGACAGGCTGGACAGAAACAGAT 
CCTGCTCGATGCCATTGACAAGATCGCCGACGACTGGGATAATCGCCACCCCCTGCCAAA 
CGCCCCTCTGGTGGCTCCCCCACAGGGGCCTATCCCTATGACCGCTAGGTTCATTAGGGG 
ACTGGGGGTGCCCCGCGAACGCCAGATGGAGCCAGCATTTGACCAATTTAGGCAGACCTA 
CAGACAGTGGATCATCGAAGCCATGAGCGAGGGGATTAAAGTCATGATCGGAAAGCCCAA 

40 GGCACAGAACATCAGGCAGGGGGCCAAGGAACCATACCCTGAGTTTGTCGACAGGCTTCT 
GTCCCAGATTAAATCCGAAGGCCACCCTCAGGAGATCTCCAAGTTCTTGACAGACACACT 
GACTATCCAAAATGCAAATGAAGAGTGCAGAAACGCCATGAGGCACCTCAGACCTGAAGA 
TACCCTGGAGGAGAAAATGTACGCATGTCGCGACATTGGCACTACCAAGCAAAAGATGAT 
GCTGCTCGCCAAGGCTCTGCAAACCGGCCTGGCTGGTCCATTCAAAGGAGGAGCACTGAA 

45 GGGAGGTCCATTGAAAGCTGCACAAACATGTTATAATTGTGGGAAGCCAGGACATTTATC 
TAGTCAATGTAGAGCACCTAAAGTCTGTTTTAAATGTAAACAGCCTGGACATTTCTCAAA 
GCAATGCAGAAGTGTTCCAAAAAACGGGAAGCAAGGGGCTCAAGGGAGGCCCCAGAAACA 
AACTTTCCCGATACAACAGAAGAGTCAGCACAACAAATCTGTTGTACAAGAGACTCCTCA 
GACTCAAAATCTGTACCCAGATCTGAGCGAAATAAAAAAGGAATACAATGTCAAGGAGAA 

50 GGATCAAGTAGAGGATCTCAACCTGGACAGTTTGTGGGAGTAACATACAATCTCGAGAAG 
AGGCCCACTACCATCGTCCTGATCAATGACACCCCTCTTAATGTGCTGCTGGACACCGGA 
GCCGACACCAGCGTTCTCACTACTGCTCACTATAACAGACTGAAATACAGAGGAAGGAAA 
TACCAGGGCACAGGCATCATCGGCGTTGGAGGCAACGTCGAAACCTTTTCCACTCCTGTC 
ACCATCAAAAAGAAGGGGAGACACATTAAAACCAGAATGCTGGTCGCCGACATCCCCGTC 

55 ACCATCCTTGGCAGAGACATTCTCCAGGACCTGGGCGCTAAACTCGTGCTGGCACAACTG 
TCTAAGGAAATCAAGTTCCGCAAGATCGAGCTGAAAGAGGGCACAATGGGTCCAAAAATC 
CCCCAGTGGCCCCTGACCAAAGAGAAGCTTGAGGGCGCTAAGGAAATCGTGCAGCGCCTG 
CITTCTGAGGGCAAGATTAGCGAGGCCAGCGACAATAACCCTTACAACAGCCCCATCTTT 
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GTGATTAAGAAAAGGAGCGGCAAATGGAGACTCCTGCAGGACCTGAGGGAACTCAACAAG 
ACCGTCCAGGTCGGAACTGAGATCTCTCGCGGACTGCCTCACCCCGGCGGCCTGATTAAA 
TGCAAGCACATGACAGTCCTTGACATTGGAGACGCTTATTTTACCATCCCCCTCGATCCT 
GAATTTCGCCCCTATACTGCTTTTACCATCCCCAGCATCAATCACCAGGAGCCCGATAAA 
5 CGCTATGTGTGGAAGTGCCTCCCCCAGGGATTTGTGCTTAGCCCCTACATTTACCAGAAG 
ACACTTCAAGAGATCCTCCAACCTTTCCGCGAAAGATACCCAGAGGTTCAACTCTACCAA 
TATATGGACGACCTGTTCATGGGGTCCAACGGGTCTAAGAAGCAGCACAAGGAACTCATC 
ATCGAACTGAGGGCAATCCTCCTGGAGAAAGGCTTCGAGACACCCGACGACAAGCTGCAA 
GAAGTTCCTCCATATAGCTGGCTGGGCTACCAGCTTTGCCCTGAAAACTGGAAAGTCCAG 

10 AAGATGCAGTTGGATATGGTCAAGAACCCAACACTGAACGACGTCCAGAAGCTCATGGGC 
AATATTACCTGGATGAGCTCCGGAATCCCTGGGCTTACCGTTAAGCACATTGCCGCAACT 
ACAAAAGGATGCCTGGAGTTGAACCAGAAGGTCATTTGGACAGAGGAAGCTCAGAAGGAA 
CTGGAGG AG AATAATGAAAAG ATT AAGAATGCTCAAGGGCTCCAATACTACAATCCCGAA 
GAAGAAATGTTGTGCGAGGTCGAAATCACTAAGAACTACGAAGCCACCTATGTCATCAAA 

15 CAGTCCCAAGGCATCTTGTGGGCCGGAAAGAAAATCATGAAGGCCAACAAAGGCTGGTCC 
ACCGTTAAAAATCTGATGCTCCTGCTCCAGCACGTCGCCACCGAGTCTATCACCCGCGTC 
GGCAAGTGCCCCACCTTCAAAGTTCCCTTCACTAAGGAGCAGGTGATGTGGGAGATGCAA 
AAAGGCTGGTACTACTCTTGGCTTCCCGAGATCGTCTACACCCACCAAGTGGTGCACGAC 
GACTGGAGAATGAAGCTTGTCGAGGAGCCCACTAGCGGAATTACAATCTATACCGACGGC 

20 GGAAAGCAAAACGGAGAGGGAATCGCTGCATACGTCACATCTAACGGCCGCACCAAGCAA 
AAGAGGCTCGGCCCTGTCACTCACCAGGTGGCTGAGAGGATGGCTATCCAGATGGCCCTT 
GAGGACACTAGAGACAAGCAGGTGAACATTGTGACTCACAGCTACTACTGCTGGAAAAAC 
ATCACAGAGGGCCTTGGCCTGGAGGGACCCCAGTCTCCCTGGTGGCCTATCATCCAGAAT 
ATCCGCGAAAAGGAAATTGTCTATTTCGCCTGGGTGCCTGGACACAAAGG.AATTTACGGC 

25 AACCAACTCGCCGATGAAGCCGCCAAAATTAAAGAGGAAATCATGCTTGCCTACCAGGGC 
ACACAGATTAAGGAGAAGAGAGACGAGGACGCTGGCTTTGACCTGTGTGTGCCATACGAC 
ATCATGATTCCCGTTAGCGACACAAAGATCATTCCAACCGATGTCAAGATCCAGGTGCCA 
CCCAATTCATTTGGTTGGGTGACCGGAAAGTCCAGCATGGCTAAGCAGGGTCTTCTGATT 
AACGGGGGAATCATTGATGAAGGATACACCGGCGAAATCCAGGTGATCTGCACAAATATC 

30 GGCAAAAGCAATATTAAGCTTATCGAAGGGCAGAAGTTCGCTCAACTCATCATCCTCCAG 
CACCACAGCAATTCAAGACAACCTTGGGACGAAAACAAGATTAGCCAGAGAGGTGACAAG 
GGCTTCGGCAGCACAGGTGTGTTCTGGGTGGAGAACATCCAGGAAGCACAGGACGAGCAC 
GAGAATTGGCACACCrCCCCTAAGATTTTGGCCCGCAATTACAAGATCCCACTGACTGTG 
GCTAAGCAGATCACACAGGAATGCCCCCACTGCACCAAACAAGGTTCTGGCCCCGCCGGC 

35 TGCGTGATGAGGTCCCCCAATCACTGGCAGGCAGATTGCACCCACCTCGACAACAAAATT 
ATCCTGACCTTCGTGGAGAGCAATTCCGGCTACATCCACGCAACACTCCTCTCCAAGGAA 
AATGCATrGTGCACCTCCCTCGCAATTCTGGAATGGGCCAGGCTGTTCTCTCCAAAATCC 
CTGCACACCGACAACGGCACCAACTTTGTGGCTGAACCTGTGGTGAATCTGCTGAAGTTC 
CTG AAAATC GCCC AC ACC ACTGGCATTCCCTATC ACCCTGAAAGCC AGGGCATTGTCG AG 

40 AGGGCCAACAGAACTCTGAAAGAAAAGATCCAATCTCACAGAGACAATACACAGACATTG 
GAGGCCGCACTTCAGCTCGCCCTTATCACCTGCAACAAAGGAAGAGAAAGCATGGGCGGC 
CAGACCCCCTGGGAGGTCTTCATCACTAACCAGGCCCAGGTCATCCATGAAAAGCTGCTC 
TTGCAGCAGGCCCAGTCCTCCAAAAAGTTCTGCTTTTATAAGATCCCCGGTGAGCACGAC 
TGGAAAGGTCCTACAAGAGTTTTGTGGAAAGGAGACGGCGCAGTTGTGGTGAACGATGAG 

45 GGCAAGGGGATCATCGCTGTGCCCCTGACACGCACCAAGCTTCTCATCAAGCCAAACTGA 
ACCCGGGGCGGCCGCTTCCCTTTAGTGAGGGTTAATGCTTCGAGCAGACATGATAAGATA 
CATTGATGAGTTTGGACAAACCACAACTAGAATGCAGTGAAAAAAATGCTTTATTTGTGA 
AATTTGTGATGCTATTGCTTTATTTGTAACCATTATAAGCTGCAATAAACAAGTTAACAA 
CAACAATTGCATTCATTTTATGTTTCAGGTTCAGGGGGAGATGTGGGAGGi 1 1 1 TTAAAG 

50 CAAGTAAAACCTCTACAAATGTGGTAAAATCCGATAAGGATCGATCCGGGCTGGCGTAAT 
AGCGAAGAGGCCCGCACCGATCGCCCTTCCCAACAGTTGCGCAGCCTGAATGGCGAATGG 
ACGCGCCCTGTAGCGGCGCATTAAGCGCGGCGGGTGTGGTGGTTACGCGCAGCGTGACCG 
CTACACTTGCCAGCGCCCTAGCGCCCGCTCCTTTCGCTTTCTTCCCITCCTTTCTCGCCA 
CGTTCGCCGGCTTTCCCCGTCAAGCTCTAAATCGGGGGCTCCCTTTAGGGTTCCGATTTA 

55 GAGCTTTACGGCACCTCGACCGCAAAAAACTTGATTTGGGTGATGGTTCACGTAGTGGGC 
CATCGCCCTGATAGACGGTTriTCGCCCTTTGACGTTGGAGTCCACGTT CTTTA ATAGTG 
GACTCTTGTTCCAAACTGGAACAACACTCAACCCTATCTCGGTCTATTCTTTTGATTTAT 
AAGGGATTTTGCCGATTTCGGCCTATTGGTTAAAAAATGAGCTGATTTAACA^ 
ACGCGAATTTTAACAAAATATTAACGTTTACAATTTCGCCTGATGCGGTATTTTCTCCTT 

60 ACGCATCTGTGCGGTATTTCACACCGCATACGCGGATCTGCGCAGCACCATGGCCTGAAA 
TAACCTCTGAAAGAGGAACTTGGTTAGGTACCTTCTGAGGCGGAAAGAACCAGCTGTGGA 
ATGTGTGTCAGTTAGGGTGTGGAAAGTCCCCAGGCTCCCCAGCAGGCAGAAGTATGCAAA 
GCATGCATCTCAATTAGTCAGCAACCAGGTGTGGAAAGTCCCCAGGCTCCCCAGCAGGCA 
GAAGTATGCAAAGCATGCATCTCAATTAGTCAGCAACCATAGTCCCGCCCCTAACTCCGC 

65 CCATCCCGCCCCTAACTCCGCCCAGTTCCGCCCATTCTCCGCCCCATGGCTGACTAATTT 
TTTTTATTTATGCAGAGGCCGAGGCCGCCTCGGCCTCTGAGCTATTCCAGAAGTAGTGAG 
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gaggcttttttgcaggcctagccttttgcaaaaagcttgattcttctgacacaacagtct 
cgaacttaaggctagagccaccatgattgaac.-\agatggattgcacgcaggttctccggc 
cgcttgggtggagaggctattcggctatgactgggcacaacagacaatcggctgctctga 
tgccgccgtgttccggctgtcagcgcaggggcgcccggttctttttgtc.aagaccgacct 
5 gtccggtgccctgaatga.actccaggacgaggcagcgcggctatcgtggctggccacgac 
gggcgttccttgcgcagctgtgctcgacgttgtcactgaagcgggaagggactggctgct 
attgggcg^vagtgccggggcaggatctcctgtcatctcaccttgctcctgccgagaaag t 
atccatcatggctgatgcaatgcggcggctgcatacgcttgatccggctacctgcccatt 
cgaccaccaagcg.aaacatcgcatcgagcgagcacgtactcggatggaagccggtcttgt 

10 cgatcaggatgatctggacga.agagcatcaggggctcgcgccagccgaactgttcgccag 
gctca.aggcgcgcatgcccgacggcgaggatctcgtcgtgacccatgccgatgcctgctt 
gccga.-\tatcatggtgga.\aatggccgcttttctggattcatcgactgtggccggctggg 
tgtggcggaccgctatcaggacatagcgttggctacccgtgatattgctgaagagcttgg 
cggcgaatgggctgaccgcttcctcgtgctttacggtatcgccgctcccgattcgcagcg 

15 catcgccttctatcgccttcttgacgagttcttctgagcgggactctggggttcgaaatg 
accgaccaagcgacgcccaacctgccatcacgatggccgcaataaaatatctttattttc 
attacatctgtgtgttggtt i i ' 1tgtgtg aatcg atagcgataagg atccgcgtatggtg 
cactctcagtacaatctgctctgatgccgcatagttaagccagccccgacacccgcc.aac 
acccgctgacgcgccctgacgggcttgtctgctcccggcatccgcttacagacaagctgt 

20 gaccgtctccgggagctgcatgtgtcagaggttttcaccgtcatcaccgaaacgcgcgag 
acgaaagggcctcgtgatacgcctatttttataggtt.aatgtcatgataataatggtttc 
ttagacgtcaggtggcacttttcggggaaatgtgcgcggaacccctatttgtttattttt 
ctaaatacattc.aaatatgtatccgctcatgagacaa.taaccctgataaatgcttc.aata 
atattg^*wvagga.agagtatgagtattcaacatttccgtgtcgcccttattcccttttt 

25 tgcggcattttgccttcctgtttttgctcacccagaaacgctggtgaaagtaaaagatgc 
tgaagatcagttgggtgcacgagtgggttacatcgaactggatctcaacagcggtaagat 
ccttgagagttttcgccccgaagaacgttttccaatgatgagcacttttaaagttctgct 
atgtggcgcggtattatcccgtattgacgccgggcaagagcaactcggtcgccgcataca 
ctattctcagaatgacttggttgagtactcaccagtcacagaaaagcatcttacggatgg 

30 catgacagtaagagaattatgcagtgctgccataaccatgagtgataacactgcggccaa 
cttacttctgacaacgatcggaggaccgaaggagctaaccgcrri l 1 i gcac aacatggg 
ggatcatgtaactcgccttgatcgttgggaaccggagctgaatgaagccataccaaacga 
cgagcgtgacaccacgatgcctgtagcaatggcaacaacgttgcgcaaactattaactgg 
cgaactacttactctagcttcccggcaacaattaatagactggatggaggcggataaagt 

35 TGCAGGACCACTTCTGCGCTCGGCCCTTCCGGCTGGCTGGTTTATTGCTGATAAATCTGG 
AGCCGGTGAGCGTGGGTCTCGCGGTATCATTGCAGCACTGGGGCCAGATGGTAAGCCCTC 
CCGTATCGTAGTTATCTACACGACGGGGAGTCAGGCAACTATGGATGAACGAAATAGACA 
GATCGCTGAGATAGGTGCCTCACTGATTAAGCATTGGTAACTGTCAGACCAAGTTTACTC 
ATATATACTTTAGATTGATTTAAAACTTCATTTTTAATTTAAAAGGATCTAGGTGAAGAT 

40 CCTTTTTGATAATCTCATGACCAAAATCCCTTAACGTGAGTTTTCGTTCCACTGAGCGTC 
AGACCCCGTAGAAAAGATCAAAGGATCTTCTTGAGATCCTTTTTTTCTGCGCGTAATCTG 
CTGCTTGCAAACAAAAAAACCACCGCTACCAGCGGTGGTTTGTTTGCCGGATCAAGAGCT 

accaactc rrrn ccg aaggtaactggcttc agc agagcgc ag atacc aaatactgtcct 

TCTAGTGTAGCCGTAGTTAGGCCACCACTTCAAGAACTCTGTAGCACCGCCTACATACCT 
45 CGCTCTGCTAATCCTGTTACCAGTGGCTGCTGCCAGTGGCGATAAGTCGTGTCTTACCGG 

GTTGGACTCAAGACGATAGTTACCGGATAAGGCGCAGCGGTCGGGCTGAACGGGGGGTTC 
GTGCACACAGCCCAGCTTGGAGCGAACGACCTACACCGAACTGAGATACCTACAGCGTGA 
GCTATGAGAAAGCGCCACGCTTCCCGAAGGGAGAAAGGCGGACAGGTATCCGGTAAGCGG 
CAGGGTCGGAACAGGAGAGC GCAC GAG GGAGCTTCCAGGGGGAAACGCCTGGTATCTTTA 
50 TAGTCCTGTCGGGTTTCGCCACCTCTGACTTGAGCGTCGATTTTTGTGATGCTCGTCAGG 
GGGGCGGAGCCTATGGAAAA.ACGCCAGCAACGCGGCCTTTTTACGGTTCCTGGCCTTTTG 
CTGGCCTTTTGCTCACATGGCTCGACAGATCT 



55 



PESDSYNGP (SEQ ID No 54) 



TCAATATTGGCCATTAGCCATATTATTCATTGGTTATATAGCATAAATCAATATTGGCTA 
TTGGCCATTGCATACGTTGTATCTATATCATAATATGTACATTTATATTGGCTCATGTCC 
AATATGACCGCCATGTTGGCATTGATTATTGACTAGTTATTAATAGTAATCAATTACGGG 
GTCATTAGTTCATAGCCCATATATGGAGTTCCGCGTTACATAACTTACGGTAAATGGCCC 

60 GCCTGGCTGACCGCCCAACGACCCCCGCCCATTGACGTCAATAATGACGTATGTTCCCAT 
AGTAACGCCAATAGGGACTTTCCATTGACGTCAATGGGTGGAGTATTTACGGTAAACTGC 
CCACTTGGCAGTACATCAAGTGTATCATATGCCAAGTCCGCCCCCTATTGACGTCAATGA 
CGGTAAATGGCCCGCCTGGCATTATGCCCAGTACATGACCTTACGGGACrTTCCTACTTG 
GCAGTACATCTACGTATTAGTCATCGCTATTACCATGGTGATGCGGTTTTGGCAGTACAC 

65 CAATGGGCGTGGATAGCGGTTTGACTCACGGGGATTTCCAAGTCTCCACCCCATTGACGT 
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C AATGGG AGTTTG TTTTGG C ACC AAAATC AAC G GG ACTTTC C A AAATGTCGT AAC AACTG 
CGATCGCCCGCCCCGTTGACGCAAATGGGCGGTAGGCGTGTACGGTGGGAGGTCTA'IATA 
AGCAGAGCTCGTTTAGTGA.ACCGTCACATCACTAGAAGCTTTATTGCGGTAGTTTATCAC 
AGTTA/VATTGCTAACGCAGTCAGTGCTTCTGACACA.ACAGTCTCGAACTTAAGCTGCAGT 
5 GACTCTCTTAAGGTAGCCTTGCAGAAGTTGGTCGTGAGGCACTGGGCAGGTAAGTATCAA 
GGTTACAAGACAGGTTTAAGGAGACCAATAGAAACTGGGCTTGTCGAGACAGAGAAGACT 
CTTGCGTTTCTGATAGGCACCTATTGGTCTTACTGACATCCACTTTGCCTTTCTCTCCAC 
AGGTGTCCACTCCCAGTTC.AATTACAGCTCTTAAGGCTAGAGTACTTAATACGACTCACT 
ATAGGCTAGAGAATTCCAGGTAAGATGGGCGATCCCCTCACCTGGTCCAAAGCCCTGAAG 

10 .AA-ACTGGAAAAAGTCACCGTTCAGGGTAGCCAAAAGCTTACCACAGGCAATTGCAACTGG 
GCATTGTCCCTGGTGGATCrTTTCCACGACACTAATTTCGTTAAGGAGAAAGATTGGCAA 
CTCAGAGACGTGATCCCCCTCTTGGAGGACGTGACCCAAACATTGTCTGGGCAGGAGCGC 
GAAGCTTTCGAGCGCACCTGGTGGGCCATCAGCGCAGTCAAAATGGGGCTGCAA.ATC.AAC 
,AACGTGGTTGACGGTA.AAGCTAGCTTTCA.ACTGCTCCGCGCTA.AGTACGAGA/\GAAAACC 

15 GCCAACAAGAAACAATCCGAACCTAGCGAGGAGTACCCAATTATGATCGACGGCGCCGGC 
.AATAGGAACTTCCGCCCACTGACTCCCAGGGGCTATACCACCTGGGTCAACACCATCCAG 
ACA.AACGGACTTTTGAACGAAGCCTCCCAGAACCTGTTCGGCATCCTGTCTGTGGACTGC 
ACCTCCGAAGAAATGAATGCTTTTCTCGACGTGGTGCCAGGACAGGCTGGACAGAAACAG 
ATCCTGCTCGATGCCATTGACAAGATCGCCGACGACTGGGATAATCGCCACCCCCTGCCA 

20 AACGCCCCTCTGGTGGCTCCCCCACAGGGGCCTATCCCTATGACCGCTAGGTTCATTAGG 
GGACTGGGGGTGCCCCGCGAACGCCAGATGGAGCCAGCATTTGACCAATTTAGGCAGACC 
TACAGACAGTGGATCATCGAAGCCATGAGCGAGGGGATTAAAGTCATGATCGGAAAGCCC 
.A.AGGCACAGAACATCAGGCAGGGGGCCAAGGAACCATACCCTGAGTTTGTCGACAGGCTT 
CTGTCCCAGATTAAATCCG.AAGGCCACCCTCAGGAGATCTCCAAGTTCTTGACAGACACA 

25 CTGACTATCCAAAATGCAAATGAAGAGTGCAGAAACGCCATGAGGCACCTCAGACCTGAA 
GATACCCTGGAGGAGAAAATGTACGCATGTCGCGACATTGGCACTACCAAGCAAAAGATG 
ATGCTGCTCGCCAAGGCTCTGCAAACCGGCCTGGCTGGTCCATTCAAAGGAGGAGCACTG 
AAGGGAGGTCCATTGAuAAGCTGCACAAACATGTTATAATTGTGGGAAGCCAGGACATTTA 
TCTAGTCAATGTAGAGCACCTAAAGTCTGTTTTAAATGTAAACAGCCTGGACATTTCTCA 

30 A-AGCAATGCAGAAGTGTTCCAAAAAACGGGAAGCAAGGGGCTCAAGGGAGGCCCCAGAAA 
CAAACTTTCCCGATACAACAGAAGAGTCAGCACAACAAATCTGTTGTACAAGAGACTCCT 
CAGACTCAAAATCTGTACCCAGATCTGAGCGAAATAA.AAAAGGAATAC.AATGTCA.AGGAG 
AAGGATCAAGTAGAGGATCTCAACCTGGACAGTTTGTGGGAGTAACATACAATCTCGAGA 
AGACGCCCACTACCATCGTCCTGATCAATGACACCCCTCTTAATGTGCTGCTGGACACCG 

35 GAGCCGACACCAGCGTTCTCACTACTGCTCACTATAACAGACTGAAATACAGAGGAAGGA 
AATACCAGGGCACAGGCATCATCGGCGTTGGAGGCAACGTCGAAACCTTTTCCACTCCTG 
TCACCATCAAAAAGAAGGGGAGACACATTAAAACCAGAATGCTGGTCGCCGACATCCCCG 
TCACCATCCTTGGCAGAGACATTCTCCAGGACCTGGGCGCTAAACTCGTGCTGGCACAAC 
TGTCTAAGGAAATCAAGTTCCGCAAGATCGAGCTGAAAGAGGGCACAATGGGTCCAAAAA 

40 TCCCCCAGTGGCCCCTGACCAAAGAGAAGCTTGAGGGCGCTAAGGAAATCGTGCAGCGCC 
TGCTTTCTGAGGGCAAGATTAGCGAGGCCAGCGACAATAACCCTTACAACAGCCCCATCT 
TTGTGATTAAGAAAAGGAGCGGCAAATGGAGACTCCTGCAGGACCTGAGGGAACTCAACA 
AGACCGTCCAGGTCGGAACTGAGATCTCTCGCGGACTGCCTCACCCCGGCGGCCTGATTA 
AATGCAAGCACATGACAGTCCTTGACATTGGAGACGCTTATTTTACCATCCCCCTCGATC 

45 CTGAATTTCGCCCCTATACTGCTTTTACCATCCCCAGCATCAATCACCAGGAGCCCGATA 
AACGCTATGTGTGGAAGTGCCTCCCCCAGGGATTTGTGCTTAGCCCCTACATTTACCAGA 
AGACACTTCAAGAGATCCTCCAACCTTTCCGCGAAAGATACCCAGAGGTTCAACTCTACC 
AATATATGGACGACCTGTTCATGGGGTCCAACGGGTCTAAGAAGCAGCACAAGGAACTCA 
TCATCGAACTGAGGGCAATCCTCCTGGAGAAAGGCTTCGAGACACCCGACGACAAGCTGC 

50 AAGAAGTTCCTCCATATAGCTGGCTGGGCTACCAGCTTTGCCCTGAAAACTGGAAAGTCC 
AGAAGATGCAGTTGGATATGGTCAAGAACCCAACACTGAACGACGTCCAGAAGCTCATGG 
GCAATATTACCTGGATGAGCTCCGGAATCCCTGGGCTTACCGTTAAGCACATTGCCGCAA 
CTACAAAAGGATGCCTGGAGTTGAACCAGAAGGTCATTTGGACAGAGGAAGCTCAGAAGG 
.AACTGGAGGAGAATAATGAAAAGATTAAGAATGCTCAAGGGCTCCAATACTACAATCCCG 

55 .A.AGAAGAAATGTTGTGCGAGGTCGAAATCACTAAGAACTACGAAGCCACCTATGTCATCA 
AACAGTCCCAAGGCATCTTGTGGGCCGGAAAGAAAATCATGAAGGCCAACAAAGGCTGGT 
CCACCGTTAAAAATCTGATGCTCCTGCTCCAGCACGTCGCCACCGAGTCTATCACCCGCG 
TCGGCAAGTGCCCCACCTTC.AAAGTTCCCTTCACTAAGGAGCAGGTGATGTGGGAGATGC 
AAAuAAGGCTGGTACTACTCTTGGCTTCCCGAGATCGTCTACACCCACCAAGTGGTGCACG 

60 ACGACTGGAGAATGAAGCTTGTCGAGGAGCCCACTACCGGAATTACAATCTATACCGACG 
GCGGAAAGCAAAACGGAGAGGGAATCGCTGCATACGTCACATCTAACGGCCGCACC.AAGC 
.AA-AAGAGGCTCGGCCCTGTCACTCACCAGGTGGCTGAGAGGATGGCTATCCAGATGGCCC 
TTGAGGACACTAGAGACAAGCAGGTGAACATTGTGACTGACAGCTACTACTGCTGGAAAA 
ACATCACAGAGGGCCTTGGCCTGGAGGGACCCCAGTCTCCCTGGTGGCCTATCATCCAGA 

65 ATATCCGCGAAAAGGAAATTGTCTATTTCGCCTGGGTGCCTGGACACAAAGGAArTTACG 
GC.AACCAACTCGCCGATGAAGCCGCCAAAATTAAAGAGGAAATCATGCTTGCCTACCAGG 
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GCACACAGATTAAGGAGAAGAGAGACGAGGACGCTGGCTTTGACCTGTGTGTGCCATAC(j 
aVCATCATGATTCCCGTTAGCGACACAAAGATCATTCC^VACCGATGTCA/\GATCCAGG'I*GC 

cacccaattcatttggttgggtgaccggaaagtccagcatggctaagcagggtcttctga 
ttaacgggggaatcattgatg^vaggatacaccggcg.aaatccaggtgatctgcacaaa YA 
5 tcggcaaaagca.atattaagcttatcga/\gggcag,aagttcgctcaactcatcatcctcc 
agcaccacagc.aattcaagac.aaccttgggacgaaaac.aagattagccagagaggtgaca 
agggcttcggcagcacaggtgtgttctgggtggagaacatccaggaagcacaggacgagc 
acgagaattggcacacctcccctaagattttggcccgca-attac.aagatcccactgactg 
tggctaagcagatcacacagg.aatgcccccactgcaccaaaca.-\ggttctggccccgccg 

10 gctgcgtgatgaggtcccccaatcactggcaggcagattgcacccacctcgacaacaaarv 
ttatcctgaccttcgtggagagcaattccggctacatccacgcaacactcctctccaagg 
.aaaatgcattgtgcacctccctcgcaattctggaatgggccaggctgttctctccaaaat 
ccctgcacaccgacaacggcaccaactttgtggctgaacctgtggtgaatctgctgaagt 
tcctgaaaatcgcccacaccactggcattccctatcaccctgaaagccagggcattgtcg 

!5 agagggccaacagaactctgaaagaaaagatccaatctcacagagacaatacacagacat 
tggaggccgcacttcagctcgcccttatcacctgcaacaaaggaagagaaagcatgggcg 
gccagaccccctgggaggtcttcatcactaaccaggcccaggtcatccatgaaaagctgc 
tcttgcagcaggcccagtcctccaaa^gttctgcttttataagatccccggtgagcacg 
actggaaaggtcctacaagagttttgtggaaaggagacggcgcagttgtggtgaacgatg 

20 agggcaaggggatcatcgctgtgcccctgacacgcaccaagcttctcatcaagccaaact 
gaacccggggcggccgcttccctttagtgagggttaatgcttcgagcagacatgataaga 
tacattgatgagtttggacaaaccacaactagaatgcagtga.aaaaaatgctttatttgt 

GAAATTTGTGATGCTATTGCTTTATTTGTAACCATTATAAGCTGCAATAAACAAGTTAAC 
AACAACAATTGCATTCATTTTATGTTTCAGGTTCAGGGGGAGATGTGGGAGG nil I 1 AA 

25 AGCAAGTAAAACCTCTACAAATGTGGTAAAATCCGATAAGGATCGATCCGGGCTGGCGTA 
ATAGCGAAGAGGCCCGCACCGATCGCCCTTCCCAACAGTTGCGCAGCCTGAATGGCGAAT 
GGACGCGCCCTGTAGCGGCGCATTAAGCGCGGCGGGTGTGGTGGTTACGCGCAGCGTGAC 
CGCTACACTTGCCAGCGCCCTAGCGCCCGCTCCTTTCGCTTTCTTCCCTTCCTTTCTCGC 
CACGTTCGCCGGCTTTCCCCGTCAAGCTCTAAATCGGGGGCTCCCTTTAGGGTTCCGATT 

30 TAGAGCTTTACGGCACCTCGACCGCAAAAAACTTGATTTGGGTGATGGTTCACGTAGTGG 
GCCATCGCCCTGATAGACGGTTTTTCGCCCTTTGACGTTGGAGTCCACGTTCTTTAATAG 
TGGACTCTTGTTCCAAACTGGAACAACACTCAACCCTATCTCGGTCTATTCTTTTGATTT 
ATAAGGGATTTTGCCGATTTCGGCCTATTGGTTAAAAAATGAGCTGATTTAA CAAA TATT 
TAACGCGAATTTTAACAAAATATTAACGTTTACAATTTCGCCTGATGCGGTATTTTCTCC 

35 TTACGCATCTGTGCGGTATTTCACACCGCATACGCGGATCTGCGCAGCACCATGGCCTGA 
AATAACCTCTGAAAGAGGAACTTGGTTAGGTACCTTCTGAGGCGGAAAGAACCAGCTGTG 
GAATGTGTGTCAGTTAGGGTGTGGAAAGTCCCCAGGCTCCCCAGCAGGCAGAAGTATGCA 
AAGCATGCATCTCAATTAGTCAGCAACCAGGTGTGGAAAGTCCCCAGGCTCCCCAGCAGG 
CAGAAGTATGCAAAGCATGCATCTCAATTAGTCAGCAACCATAGTCCCGCCCCTAACTCC 

40 GCCCATCCCGCCCCTAACTCCGCCCAGTTCCGCCCATTCTCCGCCCCATGGCTGACTAAT 

t rm n atttatgcagaggccgaggccgcctcggcctctgagctattccagaagtagtg 
aggaggcttttttggaggcctaggcttttgcaaaaagcttgattcttctgacacaacagt 
ctcgaacttaaggctagagccaccatgattgaacaagatggattgcacgcaggttctccg 
gccgcttgggtggagaggctattcggctatgactgggcacaacagacaatcggctgctct 

45 gatgccgccgtgttccggctgtcagcgcaggggcgcccggttctttttgtcaagaccgac 
ctgtccggtgccctgaatgaactgcaggacgaggcagcgcggctatcgtggctggccacg 
acgggcgttccttgcgcagctgtgctcgacgttgtcactgaagcgggaagggactggctg 
ctattgggcgaagtgccggggcaggatctcctgtcatctcaccttgctcctgccgagaaa 
gtatccatcatggctgatgcaatgcggcggctgcatacgcttgatccggctacctgccca 

50 ttcgaccaccaagcgaaacatcgcatcgagcgagcacgtactcggatggaagccggtctt 
gtcgatcaggatgatctggacgaagagcatcaggggctcgcgccagccgaactgttcgcc 
aggctcaaggcgcgcatgcccgacggcgaggatctcgtcgtgacccatggcgatgcctgc 
ttgccgaatatcatggtggaaaatggccgcttttctggattcatcgactgtggccggctg 
ggtgtggcggaccgctatcaggacatagcgttggctacccgtgatattgctgaagagctt 

55 ggcggcgaatgggctgaccgcttcctcgtgctttacggtatcgccgctcccgattcgcag 
cgcatcgccttctatcgccttcttgacgagttcttctgagcgggactctggggttcgaaa 
tgaccgaccaagcgacgcccaacctgccatcacgatggccgcaataaaatatctttattt 
tcattacatctgtgtgttggttttttgtgtgaatcgatagcgataaggatccgcgtatgg 
tgcactctcagtacaatctgctctgatgccgcatagttaagccagccccgacacccgcca 

60 acacccgctgacgcgccctgacgggcttgtctgctcccggcatccgcttacagacaagct 
gtgaccgtctccgggagctgcatgtgtcagaggttttcaccgtcatcaccgaaacgcgcg 
agacgaaagggcctcgtgatacgcctatttttataggttaatgtcatgataataatggtt 
tcttagacgtcaggtggcacttitcggggaaatgtgcgcggaacccctatttgtttattt 
ttctaaatacattcaaatatgtatccgctcatgagacaataaccctgataaatgcttcaa 

65 taatattgaau^aaggaagagtatgagtattcaacatttccgtgtcgcccttattcccttt 
tttgcggcattttgccttcctg rm TGCTC accc ag aaacgctggtgaaagtaaaagat 
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GCTGAAGATCAGTTGGGTGCACGAGTGGGTTACATCGAACTGGATCTCAACAGCGGTAAG 
ATCCTTGAGAGTTTTCGCCCCGAAGAACGTTTTCCAATGATGAGCACTTTTAAAGTTCTG 
CTATGTGGCGCGGTATTATCCCGTATTGACGCCGGGCAAGAGCAACTCGGTCGCCGCATA 
CACTATTCTCAGAATGACTTGGTTGAGTACTCACCAGTCACAGAAAAGCATCTTACGGAT 
5 GGCATGACAGTAAGAGAATTATGCAGTGCTGCCATAACCATGAGTGATAACACTGCGGCC 
AACTTACTTCTGACAACGATCGGAGGACCGAAGGAGCTAACCGCTTTTTTGCACAACATG 
GGGGATCATGTAACTCGCCTTGATCGTTGGGAACCGGAGCTGAATGAAGCCATACCAAAC 
GACGAGCGTGACACCACGATGCCTGTAGCAATGGCAACAACGTTGCGCAAACTATTAACT 
GGCGAACTACTTACTCTAGCTTCCCGGCAACAATTAATAGACTGGATGGAGGCGGATAAA 

10 GTTGCAGGACCACTTCTGCGCTCGGCCCTTCCGGCTGGCTGGTTTATTGCTGATAAATCT 

GG AGCCGGTGAGCGTGGGTCTC GCGGTATCATTGCAGCACTGGGGCCAGATGGTAAGCCC 
TCCCGTATCGTAGTTATCTACACGACGGGGAGTCAGGCAACTATGGATGAACGAAATAGA 
CAGATCGCTGAGATAGGTGCCTCACTGATTAAGCATTGGTAACTGTCAGACCAAGTTTAC 
TC ATATATACTTTAG ATTG ATTTAAAACTTC A' I" ri'AATTTAAAAGGATCTAGGTG AAG 

1 5 ATCCTTTTTG ATAATCTC ATG ACCAAAATCCCTTAACGTGAGTTTTCGTTCCACTGAGCG 
TCAGACCCCGTAGAAAAGATCAAAGGATCTTCTTGAGATCCTTTTTTTCTGCGCGTAATC 
TGCTGCTTGCAAACAAAAAAACCACCGCTACCAGCGGTGGTTTGTTTGCCGGATCAAGAG 
CTACCAACTCTTTTTCCGAAGGTAACTGGCTTCAGCAGAGCGCAGATACCAAATACTGTC 
CTTCTAGTGTAGCCGTAGTTAGGCCACCACTTCAAGAACTCTGTAGCACCGCCTACATAC 

20 CTCGCTCTGCTAATCCTGTTACCAGTGGCTGCTGCCAGTGGCGATAAGTCGTGTCTTACC 

GGGTTGGACTCAAGACGATAGTTACCGGATAAGGCGCAGCGGTCGGGCTGAACGGGGGGT 
TCGTGCACACAGCCCAGCTTGGAGCGAACGACCTACACCGAACTGAGATACCTACAGCGT 
GAGCTATGAGAAAGCGCCACGCTTCCCGAAGGGAGAAAGGCGGACAGGTATCCGGTAAGC 
GGCAGGGTCGGAACAGGAGAGCGCACGAGGGAGCTTCCAGGGGGAAACGCCTGGTATCTT 

25 TATAGTCCTGTCGGGTTTCGCCACCTCTGACTTGAGCGTCGATTTTTGTGATGCTCGTCA 

GGGGGGCGGAGCCTATGGAAAAACGCCAGCAACGCGGCCTTTTTACGGTTCCTGGCCTTT 
TGCTGGCCTTTTGCTCACATGGCTCGACAGATCT 



MLV construct CZCG (SEQ ID No 55) 

30 GTTACCTTCTGCTCTGCAGAATGGCCAACCTTTAACGTCGGATGGCCGCGAGACGGCACC 
TTTAACCGAGACCTCATCACCCAGGTTAAGATCAAGGTCTTTTCACCTGGCCCGCATGGA 
CACCCAGACCAGGTCCCCTACATCGTGACCTGGGAAGCCTTGGCTTTTGACCCCCCTCCC 
TGGGTCAAGCCCTTTGTACACCCTAAGCCTCCGCCTCCTCTTCCTCCATCCGCCCCGTCT 
CTCCCCCTTGAACCTCCTCGTTCGACCCCGCCTCGATCCTCCCTTTATCCAGCCCTCACT 

35 CCTTCTCTAGGCGCCGGAATTCGTTAACTCGAGAGGCCTGCCACCATGGGGACTGCTCCA 
AAGAAGAAGCGTAAGGTAGTCGTTTTACAACGTCGTGACTGGGAAAACCCTGGCGTTACC 
CAACTTAATCGCCTTGCAGCACATCCCCCTTTCGCCAGCTGGCGTAATAGCGAAGAGGCC 
CGCACCGATCGCCCTTCCCAACAGTTGCGCAGCCTGAATGGCGAATGGCGCTTTGCCTGG 
TTTCCGGCACCAGAAGCGGTGCCGGAAAGCTGGCTGGAGTGCGATCTTCCTGAGGCCGAT 

40 ACTGTCGTCGTCCCCTCAAACTGGCAGATGCACGGTTACGATGCGCCCATCTACACCAAC 
GTAACCTATCCCATTACGGTCAATCCGCCGTTTGTTCCCACGGAGAATCCGACGGGTTGT 
TACTCGCTCACATTTAATGTTGATGAAAGCTGGCTACAGGAAGGCCAGACGCGAATTATT 
TTTGATGGCGTTAACTCGGCGTTTCATCTGTGGTGCAACGGGCGCTGGGTCGGTTACGGC 
CAGGACAGTCGTTTGCCGTCTGAATTTGACCTGAGCGCATTTTTACGCGCCGGAGAAAAC 

45 CGCCTCGCGGTGATGGTGCTGCGTTGGAGTGACGGCAGTTATCTGGAAGATCAGGATATG 
TGGCGGATGAGCGGCATTTTCCGTGACGTCTCGTTGCTGCATAAACCGACTACACAAATC 
AGCGATTTCCATGTTGCCACTCGCTTTAATGATGATTTCAGCCGCGCTGTACTGGAGGCT 
GAAGTTCAGATGTGCGGCGAGTTGCGTGACTACCTACGGGTAACAGTTTCTTTATGGCAG 
GGTGAAACGCAGGTCGCCAGCGGCACCGCGCCTTTCGGCGGTGAAATTATCGATGAGCGT 

50 GGTGGTTATGCCGATCGCGTCACACTACGTCTGAACGTCGAAAACCCGAAACTGTGGAGC 
GCCGAAATCCCGAATCTCTATCGTGCGGTGGTTGAACTGCACACCGCCGACGGCACGCTG 
ATTGAAGCAGAAGCCTGCGATGTCGGTTTCCGCGAGGTGCGGATTGAAAATGGTCTGCTG 
CTGCTGAACGGCAAGCCGTTGCTGATTCGAGGCGTTAACCGTCACGAGCATCATCCTCTG 
CATGGTCAGGTCATGGATGAGCAGACGATGGTGCAGGATATCCTGCTGATGAAGCAGAAC 

55 AACTTTAACGCCGTGCGCTGTTCGCATTATCCGAACCATCCGCTGTGGTACACGCTGTGC 
GACCGCTACGGCCTGTATGTGGTGGATGAAGCCAATATTGAAACCCACGGCATGGTGCCA 
ATGAATCGTCTGACCGATGATCCGCGCTGGCTACCGGCGATGAGCGAACGCGTAACGCGA 
ATGGTGCAGCGCGATCGTAATCACCCGAGTGTGATCATCTGGTCGCTGGGGAATGAATCA 
GGCCACGGCGCTAATCACGACGCGCTGTATCGCTGGATCAAATCTGTCGATCCTTCCCGC 
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CCGGTGCAGTATGAAGGCGGCGGAGCCGACACCACGGCCACCGATATTATTTGCCCGATG 
TACGCGCGCGTGGATGAAGACCAGCCCTTCCCGGCTGTGCCGAAATGGTCCATCAAAAAA 
TGGCTTTCGCTACCTGGAGAGACGCGCCCGCTGATCCTTTGCGAATACGCCCACGCGATG 
GGTAACAGTCTTGGCGGTTTCGCTAAATACTGGCAGGCGTTTCGTCAGTATCCCCGTTTA 
5 CAGGGCGGCTTCGTCTGGGACTGGGTGGATCAGTCGCTGATTAAATATGATGAAAACGGC 
AACCCGTGGTCGGCTTACGGCGGTGATTTTGGCGATACGCCGAACGATCGCCAGTTCTG7 
ATGAACGGTCTGGTCTTTGCCGACCGCACGCCGCATCCAGCGCTGACGGAAGCAAAACAC 
CAGCAGCAGTTTTTCCAGTTCCGTTTATCCGGGCAAACCATCGAAGTGACCAGCGAATAC 
CTGTTCCGTCATAGCGATAACGAGCTCCTGCACTGGATGGTGGCGCTGGATGGTAAGCCG 

10 CTGGCAAGCGGTGAAGTGCCTCTGGATGTCGCTCCACAA.GGTAAACAGTTGATTGAACTG 
CCTGAACTACCGCAGCCGGAGAGCGCCGGGCAACTCTGGCTCACAGTACGCGTAGTGCAA 
CCGAACGCGACCGCATGGTCAGAAGCCGGGCACATCAGCGCCTGGCAGCAGTGGCGTCTG 
GCGGAAAACCTCAGTGTGACGCTCCCCGCCGCGTCCCACGCCATCCCGCATCTGA.CCA.ee 
AGCGAAATGGATTTTTGCATCGAGCTGGGTAATAAGCGTTGGCAATTTAACCGCCAGTCA 

15 GGCTTTCTTTCACAGATGTGGATTGGCGATAAAAAACAACTGCTGACGCCGCTGCGCGAT 
CAGTTCACCCGTGCACCGCTGGATAACGACATTGGCGTAAGTGAAGCGACCCGCATTGAC 
CCTAACGCCTGGGTCGAACGCTGGAAGGCGGCGGGCCATTACCAGGCCGAAGCAGCGTTG 
TTGCAGTGCACGGCAGATACACTTGCTGATGCGGTGCTGATTACGACCGCTCACGCGTGG 
CAGCATCAGGGGAAAACCTTATTTATCAGCCGGAAAACCTACCGGATTGATGGTAGTGGT 

20 CAAATGGCGATTACCGTTGATGTTGAAGTGGCGAGCGA7A.CACCGCATCCGGCGCGGATT 
GGCCTGAACTGCCAGCTGGCGCAGGTAGCAGAGCGGGTAAACTGGCTCGGATTAGGGCCG 
CAAGAAAACTATCCCGACCGCCTTACTGCCGCCTGTTTTGACCGCTGGGATCTGCCATTG 
TCAGACATGTATACCCCGTACGTCTTCCCGAGCGAAAACGGTCTGCGCTGCGGGACGCGC 
GAATTGAATTATGGCCCACACCAGTGGCGCGGCGACTTCCAGTTCAACATCAGCCGCTAC 

25 AGTCAACAGCAACTGATGGAAACCAGCCATCGCCATCTGCTGCACGCGGAAGAAGGCACA 
TGGCTGAATATCGACGGTTTCCATATGGGGATTGGTGGCGACGACTCCTGGAGCCCGTCA 
GTATCGGCGGAATTCCAGCTGAGCGCCGGTCGCTACCATTACCAGTTGGTCTGGTGTCAA 
AAATAATAATAACCGGGCAGGGGGGATCCGCAGATCCGGCTGTGGAATGTGTGTCAGTTA 
GGGTGTGGAAAGTCCCCAGGCTCCCCAGCAGGCAGAAGTATGCAAAGCATGCCTGCAGGA 

30 GTGGGGAGGCACGATGGCCGCTTTGGTCGAGGCGGATCCGGCCATTAGCCATATTATTCA 
TTGGTTATATAGCATAAATCAATATTGGCTATTGGCCATTGCATACGTTGTATCCATATC 
ATAATATGTACATTTATATTGGCTCATGTCCAACATTACCGCCATGTTGACATTGATTAT 
TGACTAGTTATTAATAGTAATCAATTACGGGGTCATTAGTTCATAGCCCATATATGGAGT 
TCCGCGTTACATAACTTACGGTAAATGGCCCGCCTGGCTGACCGCCCAACGACCCCCGCC 

35 CATTGACGTCAATAATGACGTATGTTCCCATAGTAACGCCAATAGGGACTTTCCATTGAC 
GTCAATGGGTGGAGTATTTACGGTAAACTGCCCACTTGGCAGTACATCAAGTGTATCATA 
TGCCAAGTACGCCCCCTATTGACGTCAATGACGGTAAATGGCCCGCCTGGCATTATGCCC 
AGTACATGACCTTATGGGACTTTCCTACTTGGCAGTACATCTACGTATTAGTCATCGCTA 
TTACCATGGTGATGCGGTTTTGGCAGTACATCAATGGGCGTGGATAGCGGTTTGACTCAC 

40 GGGGATTTCCAAGTCTCCACCCCATTGACGTCAATGGGAGTTTGTTTTGGCACCAAAATC 
AACGGGACTTTCCAAAATGTCGTAACAACTCCGCCCCATTGACGCAAATGGGCGGTAGGC 
ATGTACGGTGGGAGGTCTATATAAGCAGAGCTCGTTTAGTGAACCGTCAGATCGCCTGGA 
GACGCCATCCACGCTGTTTTGACCTCCATAGAAGACACCGGGACCGATCCAGCCTCCGCG 
GCCCCAAGCTTGTTGGGATCCACCGGTCGCCACCATGGTGAGCAAGGGCGAGGAGCTGTT 

45 CACCGGGGTGGTGCCCATCCTGGTCGAGCTGGACGGCGACGTAAACGGCCACAAGTTCAG 
CGTGTCCGGCGAGGGCGAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTG 
CACCACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCCTGACCTACGGCGT 
GCAGTGCTTCAGCCGCTACCCCGACCACATGAAGCAGCACGACTTCTTCAAGTCCGCCAT 
GCCCGAAGGCTACGTCCAGGAGCGCACCATCTTCTTCAAGGACGACGGCAACTACAAGAC 

50 CCGCGCCGAGGTGAAGTTCGAGGGCGACACCCTGGTGAACCGCATCGAGCTGAAGGGCAT 
CGACTTCAAGGAGGACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACAACAGCCA 
CAACGTCTATATCATGGCCGACAAGCAGAAGAACGGCATCAAGGTGAACTTCAAGATCCG 
CCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACTACCAGCAGAACACCCCCAT 
CGGCGACGGCCCCGTGCTGCTGCCCGACAACCACTACCTGAGCACCCAGTCCGCCCTGAG 

55 CAAAGACCCCAACGAGAAGCGCGATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGG 
GATCACTCTCGGCATGGACGAGCTGTACAAGTAAAGCGGCCGCGACTCTAGATCATAATC 
AGCCATACCACATTTGTAGAGGTTTTACTTGCTTTAAAAAACCTCCCACACCTCCCCCTG 
AACCTGAAACATAAAATGAATGCAATTGTTGTTGTTAACATCGATAAAATAAAAGATTTT 
ATTTAGTCTCCAGAAAAAGGGGGGAATGAAAGACCCCACCTGTAGGTTTGGCAAGCTAGC 

60 TTAAGTAACGCCATTTTGCAAGGCATGGAAAAATACATAACTGAGAATAGAGAAGTTCAG 
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ATCAAGGTCAGGAACAGATGGAACAGCTGAATATGGGCCAAACAGGATATCTGTGGTAAG 
CAGTTCCTGCCCCGGCTCAGGGCCAAGAACAGATGGAACAGCTGAATATGGGCCAAACAG 
GATATCTGTGGTAAGCAGTTCCTGCCCCGGCTCAGGGCCAAGAACAGATGGTCCCCAGAT 
GCGGTCCAGCCCTCAGCAGTTTCTAGAGAACCATCAGATGTTTCCAGGGTGCCCCAAGGA 
5 CCTGAAATGACCCTGTGCCTTATTTGAACTAACCAATCAGTTCGCTTCTCGCTTCTGTTC 
GCGCGCTTCTGCTCCCCGAGCTCAATAAAAGAGCCCACAACCCCTCACTCGGGGCGCCAG 
TCCTCCGATTGACTGAGTCGCCCGGGTACCCGTGTATCCAATAAACCCTCTTGCAGTTGC 
ATCCGACTTGTGGTCTCGCTGTTCCTTGGGAGGGTCTCCTCTGAGTGATTGACTACCCGT 
CAGCGGGGGTCTTTCATTTGGGGGCTCGTCCGGGATCGGGAGACCCCTGCCCAGGGACCA 

10 CCGACCCACCACCGGGAGGTAAGCTGGCCAGCAACTTATCTGTGTCTGTCCGATTGTCTA 
GTGTCTATGACTGATTTTATGCGCCTGCGTCGGTACTAGTTAGCTAACTAGCTCTGTATC 
TGGCGGACCCGTGGTGGAACTGACGAGTTCGGAACACCCGGCCGCAACCCTGGGAGAGGA 
ATTCTCATGTTTGACAGCTTATCATCGATAAGCTTTTTGCAAAAGCCTAGGCCTCCAAAA 
AAGCCTCCTCACTACTTCTGGAATAGCTCAGAGGCCGAGGCGGCCTCGGCCTCTGCATAA 

15 ATAAAAAAAATTAGTCAGCCATGGGGCGGAGAATGGGCGGAACTGGGCGGAGTTAGGGGC 
GGGATGGGCGGAGTTAGGGGCGGGACTATGGTTGCTGACTAATTGAGATGCATGCTTTGC 
ATACTTCTGCCTGCTGGGGAGCCTGGGGACTTTCCACACCTGGTTGCTGACTAATTGAGA 
TGCATGCTTTGCATACTTCTGCCTGCTGGGGAGCCTGGGGACTTTCCACACCCTAACTGA 
CACACATTCCACAGCCGGATCCTCTACGCCGGACGCATCGTGGCCGGCATCACCGGCGCC 

20 ACAGGTGCGGTTGCTGGCGCCTATATCGCCGACATCACCGATGGGGAAGATCGGGCTCGC 
CACTTCGGGCTCATGAGCGCTTGTTTCGGCGTGGGTATGGTGGCAGGCCCCGTGGCCGGG 
GGACTGTTGGGCGCCATCTCCTTGCATGCACCATTCCTTGCGGCGGCGGTGCTCAACGGC 
CTCAACCTACTACTGGGCTGCTTCCTAATGCAGGAGTCGCATAAGGGAGAGCGTCGACCG 
ATGCCCTTGAGAGCCTTCAACCCAGTCAGCTCCTTCCGGTGGGCGCGGGGCATGACTATC 

25 GTCGCCGCACTTATGACTGTCTTCTTTATCATGCAACTCGTAGGACAGGTGCCGGCAGCG 
CTCTGGGTCATTTTCGGCGAGGACCGCTTTCGCTGGAGCGCGACGATGATCGGCCTGTCG 
CTTGCGGTATTCGGAATCTTGCACGCCCTCGCTCAAGCCTTCGTCACTGGTCCCGCCACC 
AAACGTTTCGGCGAGAAGCAGGCCATTATCGCCGGCATGGCGGCCGACGCGCTGGGCTAC 
GTCTTGCTGGCGTTCGCGACGCGAGGCTGGATGGCCTTCCCCATTATGATTCTTCTCGCT 

30 TCCGGCGGCATCGGGATGCCCGCGTTGCAGGCCATGCTGTCCAGGCAGGTAGATGACGAC 
CATCAGGGACAGCTTCAAGGATCGCTCGCGGCTCTTACCAGCCTAACTTCGATCACTGGA 
CCGCTGATCGTCACGGCGATTTATGCCGCCTCGGCGAGCACATGGAACGGGTTGGCATGG 
ATTGTAGGCGCCGCCCTATACCTTGTCTGCCTCCCCGCGTTGCGTCGCGGTGCATGGAGC 
CGGGCCACCTCGACCTGAATGGAAGCCGGCGGCACCTCGCTAACGGATTCACCACTCCAA 

35 GAATTGGAGCCAATCAATTCTTGCGGAGAACTGTGAATGCGCAAACCAACCCTTGGCAGA 
ACATATCCATCGCGTCCGCCATCTCCAGCAGCCGCACGCGGCGCATCTCGGGCAGCGTTG 
GGTCCTGGCCACGGGTGCGCATGATCGTGCTCCTGTCGTTGAGGACCCGGCTAGGCTGGC 
GGGGTTGCCTTACTGGTTAGCAGAATGAATCACCGATACGCGAGCGAACGTGAAGCGACT 
GCTGCTGCAAAACGTCTGCGACCTGAGCAACAACATGAATGGTCTTCGGTTTCCGTGTTT 

40 CGTAAAGTCTGGAAACGCGGAAGTCAGCGCCCTGCACCATTATGTTCCGGATCTGCATCG 
CAGGATGCTGCTGGCTACCCTGTGGAACACCTACATCTGTATTAACGAAGCGCTGGCATT 
GACCCTGAGTGATTTTTCTCTGGTCCCGCCGCATCCATACCGCCAGTTGTTTACCCTCAC 
AACGTTCCAGTAACCGGGCATGTTCATCATCAGTAACCCGTATCGTGAGCATCCTCTCTC 
GTTTCATCGGTATCATTACCCCCATGAACAGAAATTCCCCCTTACACGGAGGCATCAAGT 

45 GACCAAACAGGAAAAAACCGCCCTTAACATGGCCCGCTTTATCAGAAGCCAGACATTAAC 
GCTTCTGGAGAAACTCAACGAGCTGGACGCGGATGAACAGGCAGACATCTGTGAATCGCT 
TCACGACCACGCTGATGAGCTTTACCGCAGCTGCCTCGCGCGTTTCGGTGATGACGGTGA 
AAACCTCTGACACATGCAGCTCCCGGAGACGGTCACAGCTTGTCTGTAAGCGGATGCCGG 
GAGCAGACAAGCCCGTCAGGGCGCGTCAGCGGGTGTTGGCGGGTGTCGGGGCGCAGCCAT 

50 GACCCAGTCACGTAGCGATAGCGGAGTGTATACTGGCTTAACTATGCGGCATCAGAGCAG 
ATTGTACTGAGAGTGCACCATATGCGGTGTGAAATACCGCACAGATGCGTAAGGAGAAAA 
TACCGCATCAGGCGCTCTTCCGCTTCCTCGCTCACTGACTCGCTGCGCTCGGTCGTTCGG 
CTGCGGCGAGCGGTATCAGCTCACTCAAAGGCGGTAATACGGTTATCCACAGAATCAGGG 
GATAACGCAGGAAAGAACATGTGAGCAAAAGGCCAGCAAAAGGCCAGGAACCGTAAAAAG 

55 GCCGCGTTGCTGGCGTTTTTCCATAGGCTCCGCCCCCCTGACGAGCATCACAAAAATCGA 
CGCTCAAGTCAGAGGTGGCGAAACCCGACAGGACTATAAAGATACCAGGCGTTTCCCCCT 
GGAAGCTCCCTCGTGCGCTCTCCTGTTCCGACCCTGCCGCTTACCGGATACCTGTCCGCC 
TTTCTCCCTTCGGGAAGCGTGGCGCTTTCTCATAGCTCACGCTGTAGGTATCTCAGTTCG 
GTGTAGGTCGTTCGCTCCAAGCTGGGCTGTGTGCACGAACCCCCCGTTCAGCCCGACCGC 

60 TGCGCCTTATCCGGTAACTATCGTCTTGAGTCCAACCCGGTAAGACACGACTTATCGCCA 
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CTGGCAGCAGCCACTGGTAACAGGATTAGCAGAGCGAGGTATGTAGGCGGTGCTACAGAG 
TTCTTGAAGTGGTGGCCTAACTACGGCTACACTAGAAGGACAGTATTTGGTATCTGCGCT 
CTGCTGAAGCCAGTTACCTTCGGAAAAAGAGTTGGTAGCTCTTGATCCGGCAAACAAACC 
ACCGCTGGTAGCGGTGGTTTTTTTGTTTGCAAGCAGCAGATTACGCGCAGAAAAAAAGGA 
5 TCTCAAGAAGATCCTTTGATCTTTTCTACGGGGTCTGACGCTCAGTGGAACGAAAACTCA 
CGTTAAGGGATTTTGGTCATGAGATTATCAAAAAGGATCTTCACCTAGATCCTTTTAAAT 
T AAAAAT G AAG T T T T AAAT C AAT C T AAAG T AT AT AT GAG T AAAC T T GGT C T G AC AG T T AC 
CAATGCTTAATCAGTGAGGCACCTATCTCAGCGATCTGTCTATTTCGTTCATCCATAGTT 
GCCTGACTCCCCGTCGTGTAGATAACTACGATACGGGAGGGCTTACCATCTGGCCCCAGT 
10 GCTGCAATGATACCGCGAGACCCACGCTCACCGGCTCCAGATTTATCAGCAATAAACCAG 
CCAGCCGGAAGGGCCGAGCGCAGAAGTGGTCCTGCAACTTTATCCGCCTCCATCCAGTCT 
ATTAATTGTTGCCGGGAAGCTAGAGTAAGTAGTTCGCCAGTTAATAGTTTGCGCAACGTT 
GT7GCCATTGCTGCAGGCATCGTGGTGTCACGCTCGTCGTTTGGTATGGCTTCATTCAGC 
TCCGGTTCCCAACGATCAAGGCGAGTTACATGATCCCCCATGTTGTGCAAAAAAGCGGTT 
15 AGCTCCTTCGGTCCTCCGATCGTCGTTGTCAGAAGTAAGTTGGCCGCAGTGTTATCACTC 
ATGGTTATGGCAGCACTGCATAATTCTCTTACTGTCATGCCATCCGTAAGATGCTTTTCT 
GTGACTGGTGAGTACTCAACCAAGTCATTCTGAGAATAGTGTATGCGGCGACCGAGTTGC 
TCTTGCCCGGCGTCAATACGGGATAATACCGCGCCACATAGCAGAACTTTAAAAGTGCTC 
ATCATTGGAAAACGTTCTTCGGGGCGAAAACTCTCAAGGATCTTACCGCTGTTGAGATCC 
20 AGTTCGATGTAACCCACTCGTGCACCCAACTGATCTTCAGCATCTTTTACTTTCACCAGC 
GTTTCTGGGTGAGCAAAAACAGGAAGGCAAAATGCCGCAAAAAAGGGAATAAGGGCGACA 
CGGAAATGTTGAATACTCATACTCTTCCTTTTTCAATATTATTGAAGCATTTATCAGGGT 
TATTGTCTCATGAGCGGATACATATTTGAATGTATTTAGAAAAATAAACAAATAGGGGTT 
CCGCGCACATTTCCCCGAAAAGTGCCACCTGACGTCTAAGAAACCATTATTATCATGACA 
25 TTAACCTAT AAAAAT AGGCGTATCACGAGGCCCTTTCGTCTCGCGCGTTTCGGTGATGAC 
GGTGAAAACCTCTGACACATGCAGCTCCCGGAGACGGTCACAGCTTGTCTGTAAGCGGAT 
GCCGGGAGCAGACAAGCCCGTCAGGGCGCGTCAGCGGGTGTTGGCGGGTGTCGGGGCTGG 
CTTAACTATGCGGCATCAGAGCAGATTGTACTGAGAGTGCACCATATCGACGCTCTCCCT 
TATGCGACTCCTGCATTAGGAAGCAGCCCAGTAGTAGGTTGAGGCCGTTGAGCACCGCCG 
30 CCGCAAGGAATGGTGCATGCAAGGAGATGGCGCCCAACAGTCCCCCGGCCACGGGGCCTG 
CCACCATACCCACGCCGAAACAAGCGCTCATGAGCCCGAAGTGGCGAGCCCGATCTTCCC 
CATCGGTGATGTCGGCGATATAGGCGCCAGCAACCGCACCTGTGGCGCCGGTGATGCCGG 
CCACGATGCGTCCGGCGTAGAGGATCTGGCTAGCGATGACCCTGCTGATTGGTTCGCTGA 
CCATTTCCGGGGTGCGGAACGGCGTTACCAGAAACTCAGAAGGTTCGTCCAACCAAACCG 
35 ACTCTGACGGCAGTTTACGAGAGAGATGATAGGGTCTGCTTCAGTAAGCCAGATGCTACA 
CAATTAGGCTTGTACATATTGTCGTTAGAACGCGGCTACAATTAATACATAACCTTATGT 
ATCATACACATACGATTTAGGTGACACTATAGAATACAAGCTGGAAGATCTTCCAGCTTG 
GGCTGCAGGTCGACTCTAGAGTCCGTTACATAACTTACGGTAAATGGCCCGCCTGGCTGA 
CCGCCCAACGACCCCCGCCCATTGACGTCAATAATGACGTATGTTCCCATAGTAACGCCA 
40 ATAGGGACTTTCCATTGACGTCAATGGGTGGAGTATTTACGGTAAACTGCCCACTTGGCA 
GTACATCAAGTGTATCATATGCCAAGTACGCCCCCTATTGACGTCAATGACGGTAAATGG 
CCCGCCTGGCATTATGCCCAGTACATGACCTTATGGGACTTTCCTACTTGGCAGTACATC 
TACGTATTAGTCATCGCTATTACCATGGTGATGCGGTTTTGGCAGTACATCAATGGGCGT 
GGATAGCGGTTTGACTCACGGGGATTTCCAAGTCTCCACCCCATTGACGTCAATGGGAGT 
45 TTGTTTTGGCACCAAAATCAACGGGACTTTCCAAAATGTCGTAACAACTCCGCCCCATTG 
ACGCAAATGGGCGGTAGGCGTGTACGGTGGGAGGTCTATATAAGCAGAGCTCGTTTAGTG 
AACCGCGCCAGTCTTCCGATAGACTGCGTCGCCCGGGTACCCGTATTCCCAATAAAGCCT 
CTTGCTGTTTGCATCCGAATCGTGGTCTCGCTGTTCCTTGGGAGGGTCTCCTCTGAGTGA 
TTGACTACCCACGACGGGGGTCTTTCATTTGGGGGCTCGTCCGGGATTTGGAGACCCCTG 
50 CCCAGGGACCACCGACCCACCACCGGGAGGTAAGCTGGCCAGCAACTTATCTGTGTCTGT 
CCGATTGTCTAGTGTCTATGTTTGATGTTATGCGCCTGCGTCTGTACTAGTTAGCTAACT 
AGCTCTGTATCTGGCGGACCCGTGGTGGAACTGACGAGTTCTGAACACCCGGCCGCAACC 
CTGGGAGACGTCCCAGGGACTTTGGGGGCCGTTTTTGTGGCCCGACCTGAGGAAGGGAGT 
CGATGTGGAATCCGACCCCGTCAGGATATGTGGTTCTGGTAGGAGACGAGAACCTAAAAC 
55 AGTTCCCGCCTCCGTCTGAATTTTTGCTTTCGGTTTGGAACCGAAGCCGCGCGTCTTGTC 
TGCTGCAGCGCTGCAGCATCGTTCTGTGTTGTCTCTGTCTGACTGTGTTTCTGTATTTGT 
CTGAAAATTAGGGCCAGACTGTTACCACTCCCTTAAGTTTGACCTTAGGTCACTGGAAAG 
ATGTCGAGCGGATCGCTCACAACCAGTCGGTAGATGTCAAGAAGAGACGTTGG 

60 PTRAP2 (SEQ ID No 56) 
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GTTACCTTCTGCTCTGCAGAATGGCCAACCTTTAACGTCGGATGGCCGCGAGACGGCACC 
TTTAACCGAGACCTCATCACCCAGGTTAAGATCAAGGTCTTTTCACCTGGCCCGCATGGA 
CACCCAGACCAGGTCCCCTACATCGTGACCTGGGAAGCCTTGGCTTTTGACCCCCCTCCC 
5 TGGGTCAAGCCCTTTGTACACCCTAAGCCTCCGCCTCCTCTTCCTCCATCCGCCCCGTCT 
CTCCCCCTTGAACCTCCTCGTTCGACCCCGCCTCGATCCTCCCTTTATCCAGCCCTCACT 
CCTTCTCTAGGCGCCGGAATTCGTTAACTCGAGAGGCCTGCCACCATGGGGACTGCTCCA 
AAGAAGAAGCGTAAGGTAGTCGTTTTACAACGTCGTGACTGGGAAAACCCTGGCGTTACC 
CAACTTAATCGCCTTGCAGCACATCCCCCTTTCGCCAGCTGGCGTAATAGCGAAGAGGCC 

10 CGCACCGATCGCCCTTCCCAACAGTTGCGCAGCCTGAATGGCGAATGGCGCTTTGCCTGG 
TTTCCGGCACCAGAAGCGGTGCCGGAAAGCTGGCTGGAGTGCGATCTTCCTGAGGCCGAT 
ACTGTCGTCGTCCCCTCAAACTGGCAGATGCACGGTTACGATGCGCCCATCTACACCAAC 
GTAACCTATCCCATTACGGTCAATCCGCCGTTTGTTCCCACGGAGAATCCGACGGGTTGT 
TACTCGCTCACATTTAATGTTGATGAAAGCTGGCTACAGGAAGGCCAGACGCGAATTATT 

!5 TTTGATGGCGTTAACTCGGCGTTTCATCTGTGGTGCAACGGGCGCTGGGTCGGTTACGGC 
CAGGACAGTCGTTTGCCGTCTGAATTTGACCTGAGCGCATTTTTACGCGCCGGAGAAAAC 
CGCCTCGCGGTGATGGTGCTGCGTTGGAGTGACGGCAGTTATCTGGAAGATCAGGATATG 
TGGCGGATGAGCGGCATTTTCCGTGACGTCTCGTTGCTGCATAAACCGACTACACAAATC 
AGCGATTTCCATGTTGCCACTCGCTTTAATGATGATTTCAGCCGCGCTGTACTGGAGGCT 

20 GAAGTTCAGATGTGCGGCGAGTTGCGTGACTACCTACGGGTAACAGTTTCTTTATGGCAG 
GGTGAAACGCAGGTCGCCAGCGGCACCGCGCCTTTCGGCGGTGAAATTATCGATGAGCGT 
GGTGGTTATGCCGATCGCGTCACACTACGTCTGAACGTCGAAAACCCGAAACTGTGGAGC 
GCCGAAATCCCGAATCTCTATCGTGCGGTGGTTGAACTGCACACCGCCGACGGCACGCTG 
ATTGAAGCAGAAGCCTGCGATGTCGGTTTCCGCGAGGTGCGGATTGAAAATGGTCTGCTG 

25 CTGCTGAACGGCAAGCCGTTGCTGATTCGAGGCGTTAACCGTCACGAGCATCATCCTCTG 
CATGGTCAGGTCATGGATGAGCAGACGATGGTGCAGGATATCCTGCTGATGAAGCAGAAC 
AACTTTAACGCCGTGCGCTGTTCGCATTATCCGAACCATCCGCTGTGGTACACGCTGTGC 
GACCGCTACGGCCTGTATGTGGTGGATGAAGCCAATATTGAAACCCACGGCATGGTGCCA 
ATGAATCGTCTGACCGATGATCCGCGCTGGCTACCGGCGATGAGCGAACGCGTAACGCGA 

30 ATGGTGCAGCGCGATCGTAATCACCCGAGTGTGATCATCTGGTCGCTGGGGAATGAATCA 
GGCCACGGCGCTAATCACGACGCGCTGTATCGCTGGATCAAATCTGTCGATCCTTCCCGC 
CCGGTGCAGTATGAAGGCGGCGGAGCCGACACCACGGCCACCGATATTATTTGCCCGATG 
TACGCGCGCGTGGATGAAGACCAGCCCTTCCCGGCTGTGCCGAAATGGTCCATCAAAAAA 
TGGCTTTCGCTACCTGGAGAGACGCGCCCGCTGATCCTTTGCGAATACGCCCACGCGATG 

35 GGTAACAGTCTTGGCGGTTTCGCTAAATACTGGCAGGCGTTTCGTCAGTATCCCCGTTTA 
CAGGGCGGCTTCGTCTGGGACTGGGTGGATCAGTCGCTGATTAAATATGATGAAAACGGC 
AACCCGTGGTCGGCTTACGGCGGTGATTTTGGCGATACGCCGAACGATCGCCAGTTCTGT 
ATGAACGGTCTGGTCTTTGCCGACCGCACGCCGCATCCAGCGCTGACGGAAGCAAAACAC 
CAGCAGCAGTTTTTCCAGTTCCGTTTATCCGGGCAAACCATCGAAGTGACCAGCGAATAC 

40 CTGTTCCGTCATAGCGATAACGAGCTCCTGCACTGGATGGTGGCGCTGGATGGTAAGCCG 
CTGGCAAGCGGTGAAGTGCCTCTGGATGTCGCTCCACAAGGTAAACAGTTGATTGAACTG 
CCTGAACTACCGCAGCCGGAGAGCGCCGGGCAACTCTGGCTCACAGTACGCGTAGTGCAA 
CCGAACGCGACCGCATGGTCAGAAGCCGGGCACATCAGCGCCTGGCAGCAGTGGCGTCTG 
GCGGAAAACCTCAGTGTGACGCTCCCCGCCGCGTCCCACGCCATCCCGCATCTGACCACC 

45 AGCGAAATGGATTTTTGCATCGAGCTGGGTAATAAGCGTTGGCAATTTAACCGCCAGTCA 
GGCTTTCTTTCACAGATGTGGATTGGCGATAAAAAACAACTGCTGACGCCGCTGCGCGAT 
CAGTTCACCCGTGCACCGCTGGATAACGACATTGGCGTAAGTGAAGCGACCCGCATTGAC 
CCTAACGCCTGGGTCGAACGCTGGAAGGCGGCGGGCCATTACCAGGCCGAAGCAGCGTTG 
TTGCAGTGCACGGCAGATACACTTGCTGATGCGGTGCTGATTACGACCGCTCACGCGTGG 

50 CAGCATCAGGGGAAAACCTTATTTATCAGCCGGAAAACCTACCGGATTGATGGTAGTGGT 
CAAATGGCGATTACCGTTGATGTTGAAGTGGCGAGCGATACACCGCATCCGGCGCGGATT 
GGCCTGAACTGCCAGCTGGCGCAGGTAGCAGAGCGGGTAAACTGGCTCGGATTAGGGCCG 
CAAGAAAACTATCCCGACCGCCTTACTGCCGCCTGTTTTGACCGCTGGGATCTGCCATTG 
TCAGACATGTATACCCCGTACGTCTTCCCGAGCGAAAACGGTCTGCGCTGCGGGACGCGC 

55 GAATTGAATTATGGCCCACACCAGTGGCGCGGCGACTTCCAGTTCAACATCAGCCGCTAC 
AGTCAACAGCAACTGATGGAAACCAGCCATCGCCATCTGCTGCACGCGGAAGAAGGCACA 
TGGCTGAATATCGACGGTTTCCATATGGGGATTGGTGGCGACGACTCCTGGAGCCCGTCA 
GTATCGGCGGAATTCCAGCTGAGCGCCGGTCGCTACCATTACCAGTTGGTCTGGTGTCAA 
AAATAATAATAACCGGGCAGGGGGGATCCGCAGATCCGGCTGTGGAATGTGTGTCAGTTA 

60 GGGTGTGGAAAGTCCCCAGGCTCCCCAGCAGGCAGAAGTATGCAAAGCATGCCTGCAGGA 
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GTGGGGAGGCACGATGGCCGCTTTGGTCGAGGCGGATCCGGCCATTAGCCATATTATTCA 
TTGGTTATATAGCATAAATCAATATTGGCTATTGGCCATTGCATACGTTGTATCCATATC 
ATAATATGTACATTTATATTGGCTCATGTCCAACATTACCGCCATGTTGACATTGATTAT 
TGACTAGTTATTAATAGTAATCAATTACGGGGTCATTAGTTCATAGCCCATATATGGAGT 
5 TCCGCGTTACATAACTTACGGTAAATGGCCCGCCTGGCTGACCGCCCAACGACCCCCGCC 
CATTGACGTCAATAATGACGTATGTTCCCATAGTAACGCCAATAGGGACTTTCCATTGAC 
GTCAATGGGTGGAGTATTTACGGTAAACTGCCCACTTGGCAGTACATCAAGTGTATCATA 
TGCCAAGTACGCCCCCTATTGACGTCAATGACGGTAAATGGCCCGCCTGGCATTATGCCC 
AGTACATGACCTTATGGGACTTTCCTACTTGGCAGTACATCTACGTATTAGTCATCGCTA 

10 TTACCATGGTGATGCGGTTTTGGCAGTACATCAATGGGCGTGGATAGCGGTTTGACTCAC 
GGGGATTTCCAAGTCTCCACCCCATTGACGTCAATGGGAGTTTGTTTTGGCACCAAAATC 
AACGGGACTTTCCAAAATGTCGTAACAACTCCGCCCCATTGACGCAAATGGGCGGTAGGC 
ATGTACGGTGGGAGGTCTATATAAGCAGAGCTCGTTTAGTGAACCGTCAGATCGCCTGGA 
GACGCCATCCACGCTGTTTTGACCTCCATAGAAGACACCGGGACCGATCCAGCCTCCGCG 

15 GCCCCAAGCTTGTTGGGATCCACCGGTCGCCACCATGGTGAGCAAGGGCGAGGAGCTGTT 
CACCGGGGTGGTGCCCATCCTGGTCGAGCTGGACGGCGACGTAAACGGCCACAAGTTCAG 
CGTGTCCGGCGAGGGCGAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTG 
CACCACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCCTGACCTACGGCGT 
GCAGTGCTTCAGCCGCTACCCCGACCACATGAAGCAGCACGACTTCTTCAAGTCCGCCAT 

20 GCCCGAAGGCTACGTCCAGGAGCGCACCATCTTCTTCAAGGACGACGGCAACTACAAGAC 
CCGCGCCGAGGTGAAGTTCGAGGGCGACACCCTGGTGAACCGCATCGAGCTGAAGGGCAT 
CGACTTCAAGGAGGACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACAACAGCCA 
CAACGTCTATATCATGGCCGACAAGCAGAAGAACGGCATCAAGGTGAACTTCAAGATCCG 
CCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACTACCAGCAGAACACCCCCAT 

25 CGGCGACGGCCCCGTGCTGCTGCCCGACAACCACTACCTGAGCACCCAGTCCGCCCTGAG 
CAAAGACCCCAACGAGAAGCGCGATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGG 
GATCACTCTCGGCATGGACGAGCTGTACAAGTAAAGCGGCCGCGACTCTAGATCATAATC 
AGCCATACCACATTTGTAGAGGTTTTACTTGCTTTAAAAAACCTCCCACACCTCCCCCTG 
AACCTGAAACATAAAATGAATGCAATTGTTGTTGTTAACATCGATAAAATAAAAGATTTT 

30 ATTTAGTCTCCAGAAAAAGGGGGGAATGAAAGACCCCACCTGTAGGTTTGGCAAGCTAGC 
ATAACTTCGTATAATGTATGCTATACGAAGTTATTCTAGAGAACCATCAGATGTTTCCAG 
GGTGCCCCAAGGACCTGAAATGACCCTGTGCCTTATTTGAACTAACCAATCAGTTCGCTT 
CTCGCTTCTGTTCGCGCGCTTCTGCTCCCCGAGCTCAATAAAAGAGCCCACAACCCCTCA 
CTCGGGGCGCCAGTCCTCCGATTGACTGAGTCGCCCGGGTACCCGTGTATCCAATAAACC 

35 CTCTTGCAGTTGCATCCGACTTGTGGTCTCGCTGTTCCTTGGGAGGGTCTCCTCTGAGTG 
ATTGACTACCCGTCAGCGGGGGTCTTTCATTTGGGGGCTCGTCCGGGATCGGGAGACCCC 
TGCCCAGGGACCACCGACCCACCACCGGGAGGTAAGCTGGCTGCCTCGCGCGTTTCGGTG 
ATGACGGTGAAAACCTCTGACACATGCAGCTCCCGGAGACGGTCACAGCTTGTCTGTAAG 
CGGATGCCGGGAGCAGACAAGCCCGTCAGGGCGCGTCAGCGGGTGTTGGCGGGTGTCGGG 

40 GCGCAGCCATGACCCAGTCACGTAGCGATAGCGGAGTGTATACTGGCTTAACTATGCGGC 
ATCAGAGCAGATTGTACTGAGAGTGCACCATATGCGGTGTGAAATACCGCACAGATGCGT 
AAGGAGAAAATACCGCATCAGGCGCTCTTCCGCTTCCTCGCTCACTGACTCGCTGCGCTC 
GGTCGTTCGGCTGCGGCGAGCGGTATCAGCTCACTCAAAGGCGGTAATACGGTTATCCAC 
AGAATCAGGGGATAACGCAGGAAAGAACATGTGAGCAAAAGGCCAGCAAAAGGCCAGGAA 

45 CCGTAAAAAGGCCGCGTTGCTGGCGTTTTTCCATAGGCTCCGCCCCCCTGACGAGCATCA 
CAAAAATCGACGCTCAAGTCAGAGGTGGCGAAACCCGACAGGACTATAAAGATACCAGGC 
GTTTCCCCCTGGAAGCTCCCTCGTGCGCTCTCCTGTTCCGACCCTGCCGCTTACCGGATA 
CCTGTCCGCCTTTCTCCCTTCGGGAAGCGTGGCGCTTTCTCATAGCTCACGCTGTAGGTA 
TCTCAGTTCGGTGTAGGTCGTTCGCTCCAAGCTGGGCTGTGTGCACGAACCCCCCGTTCA 

50 GCCCGACCGCTGCGCCTTATCCGGTAACTATCGTCTTGAGTCCAACCCGGTAAGACACGA 
CTTATCGCCACTGGCAGCAGCCACTGGTAACAGGATTAGCAGAGCGAGGTATGTAGGCGG 
TGCTACAGAGTTCTTGAAGTGGTGGCCTAACTACGGCTACACTAGAAGGACAGTATTTGG 
TATCTGCGCTCTGCTGAAGCCAGTTACCTTCGGAAAAAGAGTTGGTAGCTCTTGATCCGG 
CAAACAAACCACCGCTGGTAGCGGTGGTTTTTTTGTTTGCAAGCAGCAGATTACGCGCAG 

55 AAAAAAAGGATCTCAAGAAGATCCTTTGATCTTTTCTACGGGGTCTGACGCTCAGTGGAA 
CGAAAACTCACGTTAAGGGATTTTGGTCATGAGATTATCAAAAAGGATCTTCACCTAGAT 
CCTTTTAAATTAAAAATGAAGTTTTAAATCAATCTAAAGTATATATGAGTAAACTTGGTC 
TGACAGTTACCAATGCTTAATCAGTGAGGCACCTATCTCAGCGATCTGTCTATTTCGTTC 
ATCCATAGTTGCCTGACTCCCCGTCGTGTAGATAACTACGATACGGGAGGGCTTACCATC 

60 TGGCCCCAGTGCTGCAATGATACCGCGAGACCCACGCTCACCGGCTCCAGATTTATCAGC 
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AATAAACCAGCCAGCCGGAAGGGCCGAGCGCAGAAGTGGTCCTGCAACTTTATCCGCCTC 
CATCCAGTCTATTAATTGTTGCCGGGAAGCTAGAGTAAGTAGTTCGCCAGTTAATAGTTT 
GCGCAACGTTGTTGCCATTGCTGCAGGCATCGTGGTGTCACGCTCGTCGTTTGGTATGGC 
TTCATTCAGCTCCGGTTCCCAACGATCAAGGCGAGTTACATGATCCCCCATGTTGTGCAA 
5 • AAAAGCGGTTAGCTCCTTCGGTCCTCCGATCGTTGTCAGAAGTAAGTTGGCCGCAGTGTT 
ATCACTCATGGTTATGGCAGCACTGCATAATTCTCTTACTGTCATGCCATCCGTAAGATG 
CTTTTCTGTGACTGGTGAGTACTCAACCAAGTCATTCTGAGAATAGTGTATGCGGCGACC 
GAGTTGCTCTTGCCCGGCGTCAACACGGGATAATACCGCGCCACATAGCAGAACTTTAAA 
AGTGCTCATCATTGGAAAACGTTCTTCGGGGCGAAAACTCTCAAGGATCTTACCGCTGTT 

10 GAGATCCAGTTCGATGTAACCCACTCGTGCACCCAACTGATCTTCAGCATCTTTTACTTT 
CACCAGCGTTTCTGGGTGAGCAAAAACAGGAAGGCAAAATGCCGCAAAAAAGGGAATAAG 
GGCGACACGGAAATGTTGAATACTCATACTCTTCCTTTTTCAATATTATTGAAGCATTTA 
TCAGGGTTATTGTCTCATGAGCGGATACATATTTGAATGTATTTAGAAAAATAAACAAAT 
AGGGGTTCCGCGCACATTTCCCCGAAAAGTGCCACCTGACGTCTAAGAAACCATTATTAT 

15 CATGACATTAACCTATAAAAATAGGCGTATCACGAGGCCCTTTCGTCTTCAAGAATTCAT 
ACCAGATCACCGAAAACTGTCCTCCAAATGTGTCCCCCTCACACTCCCAAATTCGCGGGC 
TTCTGCCTCTTAGACCACTCTACCCTATTCCCCACACTCACCGGAGCCAAAGCCGCGGCC 
CTTCCGTTTCTTTGCTTTTGAAAGACCCCACCCGTAGGTGGCAAGCTAGCGATGACCCTG 
CTGATTGGTTCGCTGACCA7TTCCGGGGTGCGGAACGGCGTTACCAGAAACTCAGAAGGT 

20 TCGTCCAACCAAACCGACTCTGACGGCAGTTTACGAGAGAGATGATAGGGTCTGCTTCAG 
TAAGCCAGATGCTACACAATTAGGCTTGTACATATTGTCGTTAGAACGCGGCTACAATTA 
ATACATAACCTTATGTATCATACACATACGATTTAGGTGACACTATAGAATACAAGCTGG 
AAGATCTTCCAGCTTGGGCTGCAGGTCGACTCTAGAGTCCGTTACATAACTTACGGTAAA 
TGGCCCGCCTGGCTGACCGCCCAACGACCCCCGCCCATTGACGTCAATAATGACGTATGT 

25 TCCCATAGTAACGCCAATAGGGACTTTCCATTGACGTCAATGGGTGGAGTATTTACGGTA 
AACTGCCCACTTGGCAGTACATCAAGTGTATCATATGCCAAGTACGCCCCCTATTGACGT 
CAATGACGGTAAATGGCCCGCCTGGCATTATGCCCAGTACATGACCTTATGGGACTTTCC 
TACTTGGCAGTACATCTACGTATTAGTCATCGCTATTACCATGGTGATGCGGTTTTGGCA 
GTACATCAATGGGCGTGGATAGCGGTTTGACTCACGGGGATTTCCAAGTCTCCACCCCAT 

30 TGACGTCAATGGGAGTTTGTTTTGGCACCAAAATCAACGGGACTTTCCAAAATGTCGTAA 
CAACTCCGCCCCATTGACGCAAATGGGCGGTAGGCGTGTACGGTGGGAGGTCTATATAAG 
CAGAGCTCGTTTAGTGAACCGCGCCAGTCTTCCGATAGACTGCGTCGCCCGGGTACCCGT 
ATTCCCAATAAAGCCTCTTGCTGTTTGCATCCGAATCGTGGTCTCGCTGTTCCTTGGGAG 
GGTCTCCTCTGAGTGATTGACTACCCACGACGGGGGTCTTTCATTTGGGGGCTCGTCCGG 

35 GATTTGGAGACCCCTGCCCAGGGACCACCGACCCACCACCGGGAGGTAAGCTGGCCAGCA 
ACTTATCTGTGTCTGTCCGATTGTCTAGTGTCTATGTTTGATGTTATGCGCCTGCGTCTG 
TACTAGTTAGCTAACTAGCTCTGTATCTGGCGGACCCGTGGTGGAACTGACGAGTTCTGA 
ACACCCGGCCGCAACCCTGGGAGACGTCCCAGGGACTTTGGGGGCCGTTTTTGTGGCCCG 
ACCTGAGGAAGGGAGTCGATGTGGAATCCGACCCCGTCAGGATATGTGGTTCTGGTAGGA 

40 GACGAGAACCTAAAACAGTTCCCGCCTCCGTCTGAATTTTTGCTTTCGGTTTGGAACCGA 
AGCCGCGCGTCTTGTCTGCTGCAGCGCTGCAGCATCGTTCTGTGTTGTCTCTGTCTGACT 
GTGTTTCTGTATTTGTCTGAAAATTAGGGCCAGACTGTTACCACTCCCTTAAGTTTGACC 
TTAGGTCACTGGAAAGATGTCGAGCGGATCGCTCACAACCAGTCGGTAGATGTCAAGAAG 
AGACGTTGG 

45 

PCGCIjNGFR (SEQ ID No 57) 

GTTACCTTCTGCTCTGCAGAATGGCCAACCTTTAACGTCGGATGGCCGCGAGACGGCACC 
TTTAACCGAGACCTCATCACCCAGGTTAAGATCAAGGTCTTTTCACCTGGCCCGCATGGA 

50 CACCCAGACCAGGTCCCCTACATCGTGACCTGGGAAGCCTTGGCTTTTGACCCCCCTCCC 
TGGGTCAAGCCCTTTGTACACCCTAAGCCTCCGCCTCCTCTTCCTCCATCCGCCCCGTCT 
CTCCCCCTTGAACCTCCTCGTTCGACCCCGCCTCGATCCTCCCTTTATCCAGCCCTCACT 
CCTTCTCTAGGCGCCGGAATTCGTTAACTCGAGGATCCACCGGTCGCCACCATGGTGAGC 
AAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGGTCGAGCTGGACGGCGACGTA 

55 AACGGCCACAAGTTCAGCGTGTCCGGCGAGGGCGAGGGCGATGCCACCTACGGCAAGCTG 
ACCCTGAAGTTCATCTGCACCACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACC 
ACCCTGACCTACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCACATGAAGCAGCACGAC 
TTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGCACCATCTTCTTCAAGGAC 
GACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGGGCGACACCCTGGTGAACCGC 

60 ATCGAGCTGAAGGGCATCGACTTCAAGGAGGACGGCAACATCCTGGGGCACAAGCTGGAG 
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TACAACTACAACAGCCACAACGTCTATATCATGGCCGACAAGCAGAAGAACGGCATCAAG 
GTGAACTTCAAGATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACTAC 
CAGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAACCACTACCTGAGC 
ACCCAGTCCGCCCTGAGCAAAGACCCCAACGAGAAGCGCGATCACATGGTCCTGCTGGAG 
5 TTCGTGACCGCCGCCGGGATCACTCTCGGCATGGACGAGCTGTACAAGTAAAGCGGCCCT 
AGGGGTCTTTCCCCTCTCGCCAAAGGAATGCAAGGTCTGTTGAATGTCGTGAAGGAAGCA 
GTTCCTCTGGAAGCTTCTTGAAGACAAACAACGTCTGTAGCGACCCTTTGCAGGCAGCGG 
AACCCCCCACCTGGCGACAGGTGCCTCTGCGGCCAAAAGCCACCGAGTTGGTTCAGCTGC 
TGCCTGAGGCTGGACGACCTCGCGGAGTTCTACCGGCAGTGCAAATCCGTCGGCATCCAG 

10 GAAACCAGCAGCGGCTATCCGCGCATCCATGCCCCCGAACTGCAGGAGTGGGGAGGCACG 
ATGGCCGCTTTGGTCGAGGCGGATCCGGCCATTAGCCATATTATTCATTGGTTATATAGC 
ATAAATCAATATTGGCTATTGGCCATTGCATACGTTGTATCCATATCATAATATGTACAT 
TTATATTGGCTCATGTCCAACATTACCGCCATGTTGACATTGATTATTGACTAGTTATTA 
ATAGTAATCAATTACGGGGTCATTAGTTCATAGCCCATATATGGAGTTCCGCGTTACATA 

15 ACTTACGGTAAATGGCCCGCCTGGCTGACCGCCCAACGACCCCCGCCCATTGACGTCAAT 
AATGACGTATGTTCCCATAGTAACGCCAATAGGGACTTTCCATTGACGTCAATGGGTGGA 
GTATTTACGGTAAACTGCCCACTTGGCAGTACATCAAGTGTATCATATGCCAAGTACGCC 
CCCTATTGACGTCAATGACGGTAAATGGCCCGCCTGGCATTATGCCCAGTACATGACCTT 
ATGGGACTTTCCTACTTGGCAGTACATCTACGTATTAGTCATCGCTATTACCATGGTGAT 

20 GCGGTTTTGGCAGTACATCAATGGGCGTGGATAGCGGTTTGACTCACGGGGATTTCCAAG 
TCTCCACCCCATTGACGTCAATGGGAGTTTGTTTTGGCACCAAAATCAACGGGACTTTCC 
AAAATGTCGTAACAACTCCGCCCCATTGACGCAAATGGGCGGTAGGCATGTACGGTGGGA 
GGTCTATATAAGCAGAGCTCGTTTAGTGAACCGTCAGATCGCCTGGAGACGCCATCCACG 
CTGTTTTGACCTCCATAGAAGACACCGGGACCGATCCAGCCTCCGCGGCCCCAAGCTTAC 

25 CATGGGGGCAGGTGCCACCGGCCGCGCCATGGACGGGCCGCGCCTGCTGCTGTTGCTGCT 
TCTGGGGGTGTCCCTTGGAGGTGCCAAGGAGGCATGCCCCACAGGCCTGTACACACACAG 
CGGTGAGTGCTGCAAAGCCTGCAACCTGGGCGAGGGTGTGGCCCAGCCTTGTGGAGCCAA 
CCAGACCGTGTGTGAGCCCTGCCTGGACAGCGTGACGTTCTCCGACGTGGTGAGCGCGAC 
CGAGCCGTGCAAGCCGTGCACCGAGTGCGTGGGGCTCCAGAGCATGTCGGCGCCGTGCGT 

30 GGAGGCCGACGACGCCGTGTGCCGCTGCGCCTACGGCTACTACCAGGATGAGACGACTGG 
GCGCTGCGAGGCGTGCCGCGTGTGCGAGGCGGGCTCGGGCCTCGTGTTCTCCTGCCAGGA 
CAAGCAGAACACCGTGTGCGAGGAGTGCCCCGACGGCACGTATTCCGACGAGGCCAACCA 
CGTGGACCCGTGCCTGCCCTGCACCGTGTGCGAGGACACCGAGCGCCAGCTCCGCGAGTG 
CACACGCTGGGCCGACGCCGAGTGCGAGGAGATCCCTGGCCGTTGGATTACACGGTCCAC 

35 ACCCCCAGAGGGCTCGGACAGCACAGCCCCCAGCACCCAGGAGCCTGAGGCACCTCCAGA 
ACAAGACCTCATAGCCAGCACGGTGGCAGGTGTGGTGACCACAGTGATGGGCAGCTCCCA 
GCCCGTGGTGACCCGAGGCACCACCGACAACCTCATCCCTGTCTATTGCTCCATCCTGGC 
TGCTGTGGTTGTGGGCCTTGTGGCCTACATAGCCTTCAAGAGGTGGAACAGCTGCTGAGT 
CGACTCTAGAGGATCCCCAACATCGATAAAATAAAAGATTTTATTTAGTCTCCAGAAAAA 

40 GGGGGGAATGAAAGACCCCACCTGTAGGTTTGGCAAGCTAGCTTAAGTAACGCCATTTTG 
CAAGGCATGGAAAAATACATAACTGAGAATAGAGAAGTTCAGATCAAGGTCAGGAACAGA 
TGGAACAGCTGAATATGGGCCAAACAGGATAXCTGTGGTAAGCAGTTCCTGCCCCGGCTC 
AGGGCCAAGAACAGATGGAACAGCTGAATATGGGCCAAACAGGATATCTGTGGTAAGCAG 
TTCCTGCCCCGGCTCAGGGCCAAGAACAGATGGTCCCCAGATGCGGTCCAGCCCTCAGCA 

45 GTTTCTAGAGAACCATCAGATGTTTCCAGGGTGCCCCAAGGACCTGAAATGACCCTGTGC 
CTTATTTGAACTAACCAATCAGTTCGCTTCTCGCTTCTGTTCGCGCGCTTCTGCTCCCCG 
AGCTCAATAAAAGAGCCCACAACCCCTCACTCGGGGCGCCAGTCCTCCGATTGACTGAGT 
CGCCCGGGTACCCGTGTATCCAATAAACCCTCTTGCAGTTGCATCCGACTTGTGGTCTCG 
CTGTTCCTTGGGAGGGTCTCCTCTGAGTGATTGACTACCCGTCAGCGGGGGTCTTTCATT 

50 TGGGGGCTCGTCCGGGATCGGGAGACCCCTGCCCAGGGACCACCGACCCACCACCGGGAG 
GTAAGCTGGCTGCCTCGCGCGTTTCGGTGATGACGGTGAAAACCTCTGACACATGCAGCT 
CCCGGAGACGGTCACAGCTTGTCTGTAAGCGGATGCCGGGAGCAGACAAGCCCGTCAGGG 
CGCGTCAGCGGGTGTTGGCGGGTGTCGGGGCGCAGCCATGACCCAGTCACGTAGCGATAG 
CGGAGTGTATACTGGCTTAACTATGCGGCATCAGAGCAGATTGTACTGAGAGTGCACCAT 

55 ATGCGGTGTGAAATACCGCACAGATGCGTAAGGAGAAAATACCGCATCAGGCGCTCTTCC 
GCTTCCTCGCTCACTGACTCGCTGCGCTCGGTCGTTCGGCTGCGGCGAGCGGTATCAGCT 
CACTCAAAGGCGGTAATACGGTTATCCACAGAATCAGGGGATAACGCAGGAAAGAACATG 
TGAGCAAAAGGCCAGCAAAAGGCCAGGAACCGTAAAAAGGCCGCGTTGCTGGCGTTTTTC 
CATAGGCTCCGCCCCCCTGACGAGCATCACAAAAATCGACGCTCAAGTCAGAGGTGGCGA 

60 AACCCGACAGGACTATAAAGATACCAGGCGTTTCCCCCTGGAAGCTCCCTCGTGCGCTCT 
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CCTGTTCCGACCCTGCCGCTTACCGGATACCTGTCCGCCTTTCTCCCTTCGGGAAGCGTG 
GCGCTTTCTCATAGCTCACGCTGTAGGTATCTCAGTTCGGTGTAGGTCGTTCGCTCCAAG 
CTGGGCTGTGTGCACGAACCCCCCGTTCAGCCCGACCGCTGCGCCTTATCCGGTAACTAT 
CGTCTTGAGTCCAACCCGGTAAGACACGACTTATCGCCACTGGCAGCAGCCACTGGTAAC 
5 AGGATTAGCAGAGCGAGGTATGTAGGCGGTGCTACAGAGTTCTTGAAGTGGTGGCCTAAC 
TACGGCTACACTAGAAGGACAGTATTTGGTATCTGCGCTCTGCTGAAGCCAGTTACCTTC 
GGAAAAAGAGTTGGTAGCTCTTGATCCGGCAAACAAACCACCGCTGGTAGCGGTGGTTTT 
TTTGTTTGCAAGCAGCAGATTACGCGCAGAAAAAAAGGATCTCAAGAAGATCCTTTGATC 
TTTTCTACGGGGTCTGACGCTCAGTGGAACGAAAACTCACGTTAAGGGATTTTGGTCATG 

10 AGATTATCAAAAAGGATCTTCACCTAGATCCTTTTAAATTAAAAATGAAGTTTTAAATCA 
ATCTAAAGTATATATGAGTAAACTTGGTCTGACAGTTACCAATGCTTAATCAGTGAGGCA 
CCTATCTCAGCGATCTGTCTATTTCGTTCATCCATAGTTGCCTGACTCCCCGTCGTGTAG 
ATAACTACGATACGGGAGGGCTTACCATCTGGCCCCAGTGCTGCAATGATACCGCGAGAC 
CCACGCTCACCGGCTCCAGATTTATCAGCAATAAACCAGCCAGCCGGAAGGGCCGAGCGC 

15 AGAAGTGGTCCTGCAACTTTATCCGCCTCCATCCAGTCTATTAATTGTTGCCGGGAAGCT 
AGAGTAAGTAGTTCGCCAGTTAATAGTTTGCGCAACGTTGTTGCCATTGCTGCAGGCATC 
GTGGTGTCACGCTCGTCGTTTGGTATGGCTTCATTCAGCTCCGGTTCCCAACGATCAAGG 
CGAGTTACATGATCCCCCATGTTGTGCAAAAAAGCGGTTAGCTCCTTCGGTCCTCCGATC 
GTTGTCAGAAGTAAGTTGGCCGCAGTGTTATCACTCATGGTTATGGCAGCACTGCATAAT 

20 TCTCTTACTGTCATGCCATCCGTAAGATGCTTTTCTGTGACTGGTGAGTACTCAACCAAG 
TCATTCTGAGAATAGTGTATGCGGCGACCGAGTTGCTCTTGCCCGGCGTCAATACGGGAT 
AATACCGCGCCACATAGCAGAACTTTAAAAGTGCTCATCATTGGAAAACGTTCTTCGGGG 
CGAAAACTCTCAAGGATCTTACCGCTGTTGAGATCCAGTTCGATGTAACCCACTCGTGCA 
CCCAACTGATCTTCAGCATCTTTTACTTTCACCAGCGTTTCTGGGTGAGCAAAAACAGGA 

25 AGGCAAAATGCCGCAAAAAAGGGAATAAGGGCGACACGGAAATGTTGAATACTCATACTC 
TTCCTTTTTCAATATTATTGAAGCATTTATCAGGGTTATTGTCTCATGAGCGGATACATA 
TTTGAATGTATTTAGAAAAATAAACAAATAGGGGTTCCGCGCACATTTCCCCGAAAAGTG 
CCACCTGACGTCTAAGAAACCATTATTATCATGACATTAACCTATAAAAATAGGCGTATC 
ACGAGGCCCTTTCGTCTCGCGCGTTTCGGTGATGACGGTGAAAACCTCTGACACATGCAG 

30 CTCCCGGAGACGGTCACAGCTTGTCTGTAAGCGGATGCCGGGAGCAGACAAGCCCGTCAG 
GGCGCGTCAGCGGGTGTTGGCGGGTGTCGGGGCTGGCTTAACTATGCGGCATCAGAGCAG 
ATTGTACTGAGAGTGCACCATATGGACATATTGTCGTTAGAACGCGGCTACAATTAATAC 
ATAACCTTATGTATCATACACATACGATTTAGGTGACACTATAGAACTCGACTCTAGAGT 
CCGTTACATAACTTACGGTAAATGGCCCGCCTGGCTGACCGCCCAACGACCCCCGCCCAT 

35 TGACGTCAATAATGACGTATGTTCCCATAGTAACGCCAATAGGGACTTTCCATTGACGTC 
AATGGGTGGAGTATTTACGGTAAACTGCCCACTTGGCAGTACATCAAGTGTATCATATGC 
CAAGTACGCCCCCTATTGACGTCAATGACGGTAAATGGCCCGCCTGGCATTATGCCCAGT 
ACATGACCTTATGGGACTTTCCTACTTGGCAGTACATCTACGTATTAGTCATCGCTATTA 
CCATGGTGATGCGGTTTTGGCAGTACATCAATGGGCGTGGATAGCGGTTTGACTCACGGG 

40 GATTTCCAAGTCTCCACCCCATTGACGTCAATGGGAGTTTGTTTTGGCACCAAAATCAAC 
GGGACTTTCCAAAATGTCGTAACAACTCCGCCCCATTGACGCAAATGGGCGGTAGGCGTG 
TACGGTGGGAGGTCTATATAAGCAGAGCTCGTTTAGTGAACCGCGCCAGTCTTCCGATAG 
ACTGCGTCGCCCGGGTACCCGTATTCCCAATAAAGCCTCTTGCTGTTTGCATCCGAATCG 
TGGTCTCGCTGTTCCTTGGGAGGGTCTCCTCTGAGTGATTGACTACCCACGACGGGGGTC 

45 TTTCATTTGGGGGCTCGTCCGGGATTTGGAGACCCCTGCCCAGGGACCACCGACCCACCA 
CCGGGAGGTAAGCTGGCCAGCAACTTATCTGTGTCTGTCCGATTGTCTAGTGTCTATGTT 
TGATGTTATGCGCCTGCGTCTGTACTAGTTAGCTAACTAGCTCTGTATCTGGCGGACCCG 
TGGTGGAACTGACGAGTTCTGAACACCCGGCCGCAACCCTGGGAGACGTCCCAGGGACTT 
TGGGGGCCGTTTTTGTGGCCCGACCTGAGGAAGGGAGTCGATGTGGAATCCGACCCCGTC 

50 AGGATATGTGGTTCTGGTAGGAGACGAGAACCTAAAACAGTTCCCGCCTCCGTCTGAATT 
TTTGCTTTCGGTTTGGAACCGAAGCCGCGCGTCTTGTCTGCTGCAGCGCTGCAGCATCGT 
TCTGTGTTGTCTCTGTCTGACTGTGTTTCTGTATTTGTCTGAAAATTAGGGCCAGACTGT 
TACCACTCCCTTAAGTTTGACCTTAGGTCACTGGAAAGATGTCGAGCGGATCGCTCACAA 
CCAGTCGGTAGATGTCAAGAAGAGACGTTGG 

55 

PLTRIoxP (SEQ ID No 58) 

GCTAGCATAACTTCGTATAATGTATGCTATACGAAGTTATTCTAGAGAACCATCAGATGT 
60 TTCCAGGGTGCCCCAAGGACCTGAAATGACCCTGTGCCTTATTTGAACTAACCAATCAGT 
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TCGCTTCTCGCTTCTGTTCGCGCGCTTCTGCTCCCCGAGCTCAATAAAAGAGCCCACAAC 
CCCTCACTCGGGGCGCCAGTCCTCCGATTGACTGAGTCGCCCGGGTACCCGTGTATCCAA 
TAAACCCTCTTGCAGTTGCATCCGACTTGTGGTCTCGCTGTTCCTTGGGAGGGTCTCCTC 
TGAGTGATTGACTACCCGTCAGCGGGGGTCTTTCATTTGGGGGCTCGTCCGGGATCGGGA 
5 GACCCCTGCCCAGGGACCACCGACCCACCACCGGGAGGTAAGCTGGCTGCCTCGCGCGTT 
TCGGTGATGACGGTGAAAACCTCTGACACATGCAGCTCCCGGAGACGGTCACAGCTTGTC 
TGTAAGCGGATGCCGGGAGCAGACAAGCCCGTCAGGGCGCGTCAGCGGGTGTTGGCGGGT 
GTCGGGGCGCAGCCATGACCCAGTCACGTAGCGATAGCGGAGTGTATACTGGCTTAACTA 
TGCGGCATCAGAGCAGATTGTACTGAGAGTGCACCATATGCGGTGTGAAATACCGCACAG 

10 ATGCGTAAGGAGAAAATACCGCATCAGGCGCTCTTCCGCTTCCTCGCTCACTGACTCGCT 
GCGCTCGGTCGTTCGGCTGCGGCGAGCGGTATCAGCTCACTCAAAGGCGGTAATACGGTT 
ATCCACAGAATCAGGGGATAACGCAGGAAAGAACATGTGAGCAAAAGGCCAGCAAAAGGC 
CAGGAACCGTAAAAAGGCCGCGTTGCTGGCGTTTTTCCATAGGCTCCGCCCCCCTGACGA 
GCATCACAAAAATCGACGCTCAAGTCAGAGGTGGCGAAACCCGACAGGACTATAAAGATA 

15 CCAGGCGTTTCCCCCTGGAAGCTCCCTCGTGCGCTCTCCTGTTCCGACCCTGCCGCTTAC 
CGGATACCTGTCCGCCTTTCTCCCTTCGGGAAGCGTGGCGCTTTCTCATAGCTCACGCTG 
TAGGTATCTCAGTTCGGTGTAGGTCGTTCGCTCCAAGCTGGGCTGTGTGCACGAACCCCC 
CGTTCAGCCCGACCGCTGCGCCTTATCCGGTAACTATCGTCTTGAGTCCAACCCGGTAAG 
ACACGACTTATCGCCACTGGCAGCAGCCACTGGTAACAGGATTAGCAGAGCGAGGTATGT 

20 AGGCGGTGCTACAGAGTTCTTGAAGTGGTGGCCTAACTACGGCTACACTAGAAGGACAGT 
ATTTGGTATCTGCGCTCTGCTGAAGCCAGTTACCTTCGGAAAAAGAGTTGGTAGCTCTTG 
ATCCGGCAAACAAACCACCGCTGGTAGCGGTGGTTTTTTTGTTTGCAAGCAGCAGATTAC 
GCGCAGAAAAAAAGGATCTCAAGAAGATCCTTTGATCTTTTCTACGGGGTCTGACGCTCA 
GTGGAACGAAAACTCACGTTAAGGGATTTTGGTCATGAGATTATCAAAAAGGATCTTCAC 

25 CTAGATCCTTTTAAATTAAAAATGAAGTTTTAAATCAATCTAAAGTATATATGAGTAAAC 
TTGGTCTGACAGTTACCAATGCTTAATCAGTGAGGCACCTATCTCAGCGATCTGTCTATT 
TCGTTCATCCATAGTTGCCTGACTCCCCGTCGTGTAGATAACTACGATACGGGAGGGCTT 
ACCATCTGGCCCCAGTGCTGCAATGATACCGCGAGACCCACGCTCACCGGCTCCAGATTT 
ATCAGCAATAAACCAGCCAGCCGGAAGGGCCGAGCGCAGAAGTGGTCCTGCAACTTTATC 

30 CGCCTCCATCCAGTCTATTAATTGTTGCCGGGAAGCTAGAGTAAGTAGTTCGCCAGTTAA 
TAGTTTGCGCAACGTTGTTGCCATTGCTGCAGGCATCGTGGTGTCACGCTCGTCGTTTGG 
TATGGCTTCATTCAGCTCCGGTTCCCAACGATCAAGGCGAGTTACATGATCCCCCATGTT 
GTGCAAAAAAGCGGTTAGCTCCTTCGGTCCTCCGATCGTTGTCAGAAGTAAGTTGGCCGC 
AGTGTTATCACTCATGGTTATGGCAGCACTGCATAATTCTCTTACTGTCATGCCATCCGT 

35 AAGATGCTTTTCTGTGACTGGTGAGTACTCAACCAAGTCATTCTGAGAATAGTGTATGCG 
GCGACCGAGTTGCTCTTGCCCGGCGTCAACACGGGATAATACCGCGCCACATAGCAGAAC 
TTTAAAAGTGCTCATCATTGGAAAACGTTCTTCGGGGCGAAAACTCTCAAGGATCTTACC 
GCTGTTGAGATCCAGTTCGATGTAACCCACTCGTGCACCCAACTGATCTTCAGCATCTTT 
TACTTTCACCAGCGTTTCTGGGTGAGCAAAAACAGGAAGGCAAAATGCCGCAAAAAAGGG 

40 AATAAGGGCGACACGGAAATGTTGAATACTCATACTCTTCCTTTTTCAATATTATTGAAG 
CATTTATCAGGGTTATTGTCTCATGAGCGGATACATATTTGAATGTATTTAGAAAAATAA 
ACAAATAGGGGTTCCGCGCACATTTCCCCGAAAAGTGCCACCTGACGTCTAAGAAACCAT 
TATTATCATGACATTAACCTATAAAAATAGGCGTATCACGAGGCCCTTTCGTCTTCAAGA 
ATTCATACCAGATCACCGAAAACTGTCCTCCAAATGTGTCCCCCTCACACTCCCAAATTC 

45 GCGGGCTTCTGCCTCTTAGACCACTCTACCCTATTCCCCACACTCACCGGAGCCAAAGCC 
GCGGCCCTTCCGTTTCTTTGCTTTTGAAAGACCCCACCCGTAGGTGGCAA 

LTR plasmid (SEQ ID No 59) 

GCTAGCTTAAGTAACGCCATTTTGCAAGGCATGGAAAT^ATACATAACTGAGAATAGAGAA 
50 GTTCAGATCAAGGTCAGGAACAGATGGAACAGCTGAATATGGGCCAAACAGGATATCTGT 
GGTAAGCAGTTCCTGCCCCGGCTCAGGGCCAAGAACAGATGGAACAGCTGAATATGGGCC 
AAACAGGATATCTGTGGTAAGCAGTTCCTGCCCCGGCTCAGGGCCAAGAACAGATGGTCC 
CCAGATGCGGTCCAGCCCTCAGCAGTTTCTAGAGAACCATCAGATGTTTCCAGGGTGCCC 
CAAGGACCTGAAATGACCCTGTGCCTTATTTGAACTAACCAATCAGTTCGCTTCTCGCTT 
55 CTGTTCGCGCGCTTCTGCTCCCCGAGCTCAATAAAAGAGCCCACAACCCCTCACTCGGGG 
CGCCAGTCCTCCGATTGACTGAGTCGCCCGGGTACCCGTGTATCCAATAAACCCTCTTGC 
AGTTGCATCCGACTTGTGGTCTCGCTGTTCCTTGGGAGGGTCTCCTCTGAGTGATTGACT 
ACCCGTCAGCGGGGGTCTTTCATTTGGGGGCTCGTCCGGGATCGGGAGACCCCTGCCCAG 
GGACCACCGACCCACCACCGGGAGGTAAGCTGGCTGCCTCGCGCGTTTCGGTGATGACGG 
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TGAAAACCTCTGACACATGCAGCTCCCGGAGACGGTCACAGCTTGTCTGTAAGCGGATGC 
CGGGAGCAGACAAGCCCGTCAGGGCGCGTCAGCGGGTGTTGGCGGGTGTCGGGGCGCAGC 
CATGACCCAGTCACGTAGCGATAGCGGAGTGTATACTGGCTTAACTATGCGGCATCAGAG 
CAGATTGTACTGAGAGTGCACCATATGCGGTGTGAAATACCGCACAGATGCGTAAGGAGA 
5 AAATACCGCATCAGGCGCTCTTCCGCTTCCTCGCTCACTGACTCGCTGCGCTCGGTCGTT 
CGGCTGCGGCGAGCGGTATCAGCTCACTCAAAGGCGGTAATACGGTTATCCACAGAATCA 
GGGGATAACGCAGGAAAGAACATGTGAGCAAAAGGCCAGCAAAAGGCCAGGAACCGTAAA 
AAGGCCGCGTTGCTGGCGTTTTTCCATAGGCTCCGCCCCCCTGACGAGCATCACAAAAAT 
CGACGCTCAAGTCAGAGGTGGCGAAACCCGACAGGACTATAAAGATACCAGGCGTTTCCC 

10 CCTGGAAGCTCCCTCGTGCGCTCTCCTGTTCCGACCCTGCCGCTTACCGGATACCTGTCC 
GCCTTTCTCCCTTCGGGAAGCGTGGCGCTTTCTCATAGCTCACGCTGTAGGTATCTCAGT 
TCGGTGTAGGTCGTTCGCTCCAAGCTGGGCTGTGTGCACGAACCCCCCGTTCAGCCCGAC 
CGCTGCGCCTTATCCGGTAACTATCGTCTTGAGTCCAACCCGGTAAGACACGACTTATCG 
CCACTGGCAGCAGCCACTGGTAACAGGATTAGCAGAGCGAGGTATGTAGGCGGTGCTACA 

15 GAGTTCTTGAAGTGGTGGCCTAACTACGGCTACACTAGAAGGACAGTATTTGGTATCTGC 
GCTCTGCTGAAGCCAGTTACCTTCGGAAAAAGAGTTGGTAGCTCTTGATCCGGCAAACAA 
ACCACCGCTGGTAGCGGTGGTTTTTTTGTTTGCAAGCAGCAGATTACGCGCAGAAAAAAA 
GGATCTCAAGAAGATCCTTTGATCTTTTCTACGGGGTCTGACGCTCAGTGGAACGAAAAC 
TCACGTTAAGGGATTTTGGTCATGAGATTATCAAAAAGGATCTTCACCTAGATCCTTTTA 

20 AATTAAAAATGAAGTTTTAAATCAATCTAAAGTATATATGAGTAAACTTGGTCTGACAGT 
TACCAATGCTTAATCAGTGAGGCACCTATCTCAGCGATCTGTCTATTTCGTTCATCCATA 
GTTGCCTGACTCCCCGTCGTGTAGATAACTACGATACGGGAGGGCTTACCATCTGGCCCC 
AGTGCTGCAATGATACCGCGAGACCCACGCTCACCGGCTCCAGATTTATCAGCAATAAAC 
CAGCCAGCCGGAAGGGCCGAGCGCAGAAGTGGTCCTGCAACTTTATCCGCCTCCATCCAG 

25 TCTATTAATTGTTGCCGGGAAGCTAGAGTAAGTAGTTCGCCAGTTAATAGTTTGCGCAAC 
GTTGTTGCCATTGCTGCAGGCATCGTGGTGTCACGCTCGTCGTTTGGTATGGCTTCATTC 
AGCTCCGGTTCCCAACGATCAAGGCGAGTTACATGATCCCCCATGTTGTGCAAAAAAGCG 
GTTAGCTCCTTCGGTCCTCCGATCGTTGTCAGAAGTAAGTTGGCCGCAGTGTTATCACTC 
ATGGTTATGGCAGCACTGCATAATTCTCTTACTGTCATGCCATCCGTAAGATGCTTTTCT 

30 GTGACTGGTGAGTACTCAACCAAGTCATTCTGAGAATAGTGTATGCGGCGACCGAGTTGC 
TCTTGCCCGGCGTCAACACGGGATAATACCGCGCCACATAGCAGAACTTTAAAAGTGCTC 
ATCATTGGAAAACGTTCTTCGGGGCGAAAACTCTCAAGGATCTTACCGCTGTTGAGATCC 
AGTTCGATGTAACCCACTCGTGCACCCAACTGATCTTCAGCATCTTTTACTTTCACCAGC 
GTTTCTGGGTGAGCAAAAACAGGAAGGCAAAATGCCGCAAAAAAGGGAATAAGGGCGACA 

35 CGGAAATGTTGAATACTCATACTCTTCCTTTTTCAATATTATTGAAGCATTTATCAGGGT 
TATTGTCTCATGAGCGGATACATATTTGAATGTATTTAGAAAAATAAACAAATAGGGGTT 
CCGCGCACATTTCCCCGAAAAGTGCCACCTGACGTCTAAGAAACCATTATTATCATGACA 
TTAACCTATAAAAATAGGCGTATCACGAGGCCCTTTCGTCTTCAAGAATTCATACCAGAT 
CACCGAAAACTGTCCTCCAAATGTGTCCCCCTCACACTCCCAAATTCGCGGGCTTCTGCC 

40 TCTTAGACCACTCTACCCTATTCCCCACACTCACCGGAGCCAAAGCCGCGGCCCTTCCGT 
TTCTTTGCTTTTGAAAGACCCCACCCGTAGGTGGCAA 
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1. A method of modifying a producer cell which producer cell comprises integrated 
into its genome a provirus which provirus comprises one or more recombinase 

5 recognition sequences within or upstream of its 3' LTR, the method comprising: 

introducing into the cell a construct comprising a 5' recombinase recognition sequence, 
an LTR and a 3' recombinase recognition sequence in that order, in the presence of a 
recombinase which is capable of acting on the recombinase recognition site(s) such that 
the nucleotide sequence between the 5' and 3' recombinase recognition sequences in the 

10 construct is introduced into the provirus. 

2. A method according to claim 1 wherein the construct further comprises at least 
one nucleotide sequence of interest (NOI) between the 5' recombinase recognition 
sequence and the LTR, which NOI is operably linked to a transcriptional regulatory 

15 sequence. 

3. A method according to claim 1 or claim 2 wherein the construct further comprises 
a 5' LTR and/or a packaging signal. 

20 4. A method according to any one of claims 1 to 3 wherein the LTR is a 
heterologous regulatable LTR. 

5. A method according to claim 4 wherein the regulatable LTR comprises an 
ischaemic like response element (ILRE). 

25 

6. A method according to any one of claims 1 to 3 wherein the LTR is inactive. 

7. A method according to any one of the preceding claims wherein the provirus 
comprises an NOI encoding a selectable marker, which NOI is flanked by recombinase 

30 recognition sites 



8. A method according to any one of the precedings claims wherein the provirus 
comprises an internal 5' LTR upstream of the recombinase site or the 5' recombinase site 
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where there is more than one site. 

9. A method according to any one of the preceding claims wherein the U3 region of 
the 5' LTR is inactive. 

5 

10. A method according to any one of the preceding claims wherein the U3 region of 
the 5' LTR and/or the U3 region of the second internal 5TTR comprises a heterologous 
promoter. 

10 11. A method according to any one of the preceding claims wherein the pro virus 
comprises two recombinase recognition sites and as a preliminary step, the recombinase 
is expressed in a host cell such that the nucleotide sequence present between the two sites 
is excised. 

15 12. A method according to any one of the preceding claims wherein the producer cell 
is a high titre producer cell. 

13. A method according to any one of the preceding claims wherein the provirus is a 
lentivirus. 

20 

14. A method according to claim 13 wherein the lentivirus is HIV or EIAV. 

15. A method according to any one of claims 2-14 wherein the provirus further 
comprises a second NOI. 

25 

16. A producer cell obtainable by the method of any one of claims 1 to 15. 

17. An infectious retroviral particle obtainable from the producer cell of claim 16. 

30 18. A derived producer cell comprising integrated into its genome a retroviral vector 
comprising in the 5' to 3' direction a first 5' LTR; a second NOI operably linked to a 
second regulatable 3' LTR; and a third 3 'LTR; 
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vvherein the third 3* LTR is positioned downstream of the second reguiatabie 
3'LTR in the producer cell. 

19. A producer cell according to claim 18 wherein the first 5' LTR comprising 5'R 
5 and 5" U5 sequences is derivable from a first vector; the second NOI operably linked to a 

second reguiatabie 3' LTR is derivable from a second vector; and the third 3'LTR is 
derivable from the first vector. 

20. A producer cell according to claim 18 or claim 19 wherein the first vector 
10 comprises a retroviral vector wherein the retroviral vector comprises a first NOI flanked 

by recombinase recognition sequences. 

21. A producer cell according to claim 19 or claim 20 wherein the retroviral vector 
further comprises an internal LTR located upstream of the first NOI and downstream of a 

15 packaging signal wherein the internal LTR comprises a heterologous U3 sequence linked 
to heterologous R and U5 sequences. 

22. A producer cell according to any one of claims 18 to 21 wherein the third 3'LTR 
is transcriptionally quiescent. 

20 

23. A producer cell according to claim 22 wherein the third 3' LTR comprises a 
deletion in the U3 sequence. 

24. A producer cell according to any one of claims 18 to 23 wherein the first NOI is a 
25 selectable marker. 

25. A producer cell according to claim 19 wherein the second vector comprises a 
second NOI operably linked to a second reguiatabie 3'LTR comprising at least one 
recombinase recognition sequence. 

30 

26. A producer cell according to 25 wherein the second reguiatabie 3'LTR comprises 
a deletion in the U3 sequences in the 3'LTR. 
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27. A producer cell according to claim 25 or claim 26 wherein the second NOI 
comprises a coding sequence operably linked to a promoter. 

28. A producer cell according to claim 27 wherein the second NOI comprises a 
discistronic construct. 

29. A producer cell according to claim 28 wherein the discistronic construct 
comprises a therapeutic gene, an internal ribosomal entry site (IRES) and a reporter gene. 

30. A method for producing a high titre reguiatable retroviral vector, the method 
comprising the steps of: 

(i) providing a derived producer cell comprising integrated into its genome a first vector; 

(ii) introducing a second vector into the derived producer cell using a recombinase 
assisted method; 

wherein the derived producer cell comprises a retroviral vector comprising in the 5* to 3' 
direction a first 5* LTR; a second NOI operably linked to a second reguiatable 3' LTR; 
and a third 3 'LTR; wherein the third 3 'LTR is positioned downstream of the second 
reguiatable 3' LTR in the derived producer cell. 

31. A method according to claim 30 wherein the third 3' LTR is transcriptionally 
active but expression is directed away from the second reguiatable 3' LTR. 

32. A method for introducing a second reguiatable 3' LTR into a derived producer cell 
wherein the method comprises a recombinase assisted method. 

33. A method according to claim 32 wherein the recombinase assisted method is a 
Cre/lox recombinase method. 
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34. A process for preparing a regulated retroviral vector as defined in claim 17 
comprising performing the method according to any one of claims 30 to 33 and preparing 
a quantity of the regulated retroviral vector. 

35. A regulated retroviral vector produced by the process according to claim 34. 

36. A regulated retroviral vector according to claim 35 wherein the retroviral vector is 
capable of transducing a target site. 

37. A regulated retroviral vector according to claim 36 wherein the retroviral vector is 
produced in sufficient amounts to effectively transduce a target site. 

38. A regulated retroviral vector according to claim 36 or claim 37 wherein the target 
site is a cell. 

39. A cell transduced with a regulated retroviral vector according to claim 38. 

40. Use of a regulated retroviral vector according to any one of claims 35 to 38 in the 
manufacture of a pharmaceutical composition to deliver an NOI to a target site. 

41 . Use of a regulated retroviral vector according to any one of claims 35 to 38 in the 
manufacture of a medicament for diagnostic and/or therapeutic and/or medical 
applications. 

i 42. Use of a recombinase assisted mechanism to introduce a regulated 3'LTR into a 
derived producer cell line to produce a high titre regulated retroviral vector. 

43. A derived stable producer cell capable of expressing regulated retroviral vectors 
according to claims 35 to 38. 

D 

44 A derived stable producer cell according to claim 43 wherein the regulated 
retroviral vector is a high titre regulated retroviral vector. 
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49. A nucleic acid molecule according to any one of claims 46 to 48 wherein the LTR 
is a heterologous regulatable LTR. 

50. A nucleic acid molecule according to any one of claims 46 to 48 wherein the LTR 
is transcriptionally quiescent. 

51. A method and/or a producer cell substantially as described herein and with 
reference to the accompanying Figures. 
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FIG. 1 

MLV-based transduction using Cre/loxP system as previously described 
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FIG. 2 

EIAV-based transduction Cre/loxP system 
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CMV R 



FIG. 3 

MLV SIN vector approach, with EIAV components in blue 
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CMV 



FIG. 4 

MLV-based transduction with HRE 3' LTR using Cre/loxP system 
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CMV 



FIG. 5 

MLV-based transduction for SIN vector production using Cre/loxP system 
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FIG. 6 



MLV SIN-vector based transduction system. This general approach can be 
used with EIAV, HIV or MLV genomes 
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Virion RNA 



FIG. 7 
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Overall summary of recombinase method 
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FACS analysis of EV1 packaging 
cells prior to transduction with 
Trap2 vector 
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FIG. 10b 

FACS analysis of EV1 packaging 
cell line transduced with Trap2 at an 
MOI of 0.3. A 5% top-slice of the 
highest expressers was carried out 
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FIG. 11 



Validation of the AACt method for quantitation of GFP mRNA, relative to (5-actin. 
A titration of total RNA from EV1 clone A was used. The difference in Ct values 
between the two assays is shown on the y-axis. The magnitude of the gradient must be 
<0.1 for the method to be valid. The gradient is 0.077, so the method is suitable. 

y = -0.077X + 2.207 

R 2 = 0.4137 
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FIG. 12 

Quantitation of GFP mRNA relative to control p-actin 
mRNA. EV2 TD cells are transduced with Trap2 at an MOI of 0.3 
and are the calibrator sample with the ratio designated 1.0. 
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FIG. 16 

Alignment of leader and gag regions present in vectors pONY4Z, 8Z and 
ATG mutated 8Z vector. The later is referred to as pONY8ZA. The sequence 
aligned are from the Narl site in the leader to the Xbal site between the EIAV 
gag sequence and the CMV promoter. Sequences in the leader are shown in 
italic and a space is present upstream of the position of the gag ATG. 



4Z 1 cgcccgaacagggacctgagaggggcgcagaccctacctgttgaacctgg 

8Z 1 cgcccgaacagggacctgagaggggcgcagaccctacctgttgaacctgg 

mutated 8Z 1 cgcccgaacagggacctgagaggggcgcagaccctacctgttgaacctgg 

4Z 51 ctgatcgtaggatccccgggacagcagaggagaacttacagaagtcttct 

8Z 51 ctgatcgtaggatccccgggacagcagaggagaacttacagaagtcttct 

mutated 8Z 5 1 ctgatcgtaggatccccgggacagcagaggagaacttacagaagtcttct 

4Z 101 ggaggtgttcctggccagaacacaggaggacaggtaag.at-gggagaccc 

8Z 101 ggaggtgttcctggccagaacacaggaggacaggtaag.attgggagaccc 
mutated 8Z 101 ggaggtgttcctggccagaacacaggaggacaggtaag.attgggagaccc 

4Z 150 tttgacat-ggagcaaggcgctcaagaagttagagaaggtgacggtacaa 

8Z 151 tttgacattggagcaaggcgctcaagaagttagagaaggtgacggtacaa 
mutated 8Z 151 tttgacattggagcaaggcgctcaagaagttagagaaggtgacggtacaa 

4Z 1 99 gggtctcagaaattaactactggtaactgtaattgggcgctaagtctagt 

8Z 20 1 gggtctcagaaattaactactggtaactgtaattgggcgctaagtctagt 
mutated 8Z 201 gggtctcagaaattaactactggtaactgtaattgggcgctaagtctagt 

4Z 249 agacttatttcat-gataccaactttgtaaaagaaaaggactggcagctg 
8Z 25 1 agacttatttcat-gataccaactttgtaaaagaaaaggactggcagctg 

mutated 8Z 25 1 agacttatttcattgataccaactttgtaaaagaaaaggactggcagctg 
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4Z 298 agggat-gtcattccattgctggaagat-gtaactcagacgctgtcagga 

8Z 3 00 agggat-gtcattccattgctggaagat-gtaactcagacgctgtcagga 

mutated 8Z 301 agggattgtcattccattgctggaagattgtaactcagacgctgtcagga 

4Z 346caagaaagagaggcctttgaaagaacat-ggtgggcaatttctgctgtaa 

8Z 348 caagaaagagaggcctttgaaagaacat-ggtgggcaatttctgctgtaa 

mutated 8Z 351 caagaaagagaggcctttgaaagaacattggtgggcaatttctgctgtaa 

4Z 395 agat-gggcctccagattaataat-gtagtagat-ggaaaggcatcattc 

8Z 3 97 agat-gggcctccagattaataat-gtagtagat-ggaaaggcatcattc 

mutated 8Z 401 agattgggcctccagattaataattgtagtagattggaaaggcatcattc 

4Z 442 cagctcctaagagcgaaatat-gaaaagaagactgctaataaaaagcagt 

8Z 444 cagctcctaagagcgaaatat-gaaaagaagactgctaataaaaagcagt 

mutated 8Z 451 cagctcctaagagcgaaatattgaaaagaagactgctaataaaaagcagt 

4Z 491 ctgagccctctgaagaatatct 

8Z 493 ctgagccctctgaagaatatct 

mutated 8Z 501 ctgagccctctgaagaatatct 

FIG. 1 6 CONT'D 
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Atty. Dkt. No. 078883-0146 



name; 



DECLARATION AND POWER OF ATTORNEY 

As a below named inventor, I HEREBY DECLARE: 

THAT my residence, post office address, and citizenship are as stated below next to my 



THAT I believe I am the original, first, and sole inventor (if only one inventor is named 
below) or an original, first, and joint inventor (if plural inventors are named below or in an 
attached Declaration) of the subject matter which is claimed and for which a patent is sought 
on the invention entitled 



PRODUCER CELL FOR THE PRODUCTION OF RETROVIRAL VECTORS 



(Attorney Docket No. 078883-0146) 

the specification of which (check one) 

is attached hereto. 

x was filed on 10/05/2000 as United States Application Number or 

PCT International Application Number PCT/GBOO/03837 

(if applicable). 

THAT I do not know and do not believe that the same invention was ever known or 
used by others in the United States of America, or was patented or described in any printed 
publication in any country, before I (we) invented it; 

THAT I do not know and do not believe that the same invention was patented or 
described in any printed publication in any country, or in public use or on sale in the United 
States of America, for more than one year prior to the filing date of this United States 
application; 

THAT I do not know and do not believe that the same invention was first patented or 
made the subject of an inventor's certificate that issued in any country foreign to the United 
States of America before the filing date of this United States application if the foreign 
application was filed by me (us), or by my (our) legal representatives or assigns, more than 
twelve months (six months for design patents) prior to the filing date of this United States 
application; 

THAT I have reviewed and understand the contents of the above-identified specification, 
including the claim(s), as amended by any amendment specifically referred to above; 

THAT I believe that the above-identified specification contains a written description of 
the invention, and of the manner and process of making and using it, in such full, clear, concise, 
and exact terms as to enable any person skilled in the art to which it pertains, or with which it 
is most nearly connected, to make and use the invention, and sets forth the best mode 
contemplated by me of carrying out the invention; and 

THAT I acknowledge the duty to disclose to the U.S. Patent and Trademark Office all 
information known to me to be material to patentability as defined in Title 37, Code of Federal 
Regulations, §1 .56. 
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Atty. Dkt. No. 078883-0146 

I HEREBY CLAIM foreign priority benefits under Title 35, United States Code §1 19(a)-(d) 
or § 365(b) of any foreign application(s) for patent or inventor's certificate, or §365(a) of any 
PCT international application which designated at least one country other than the United States 
of America, listed below and have also identified below any foreign application for patent or 
inventor's certificate or of any PCT international application having a filing date before that of 
the application on which priority is claimed. 



| Prior Foreign 

Application Number 


Country 


Foreign Filing Date 


Priority 
Claimed? 


Certified | 

Copy 
Attached? 


9923558.2 


Great Britain 


10/05/1999 


YES 

























I HEREBY CLAIM the benefit under Title 35, United States Code § 1 19(e) of any United 
States provisional application(s) listed below. 



j U.S. Provisional Application Number 


Filing Date 
















I HEREBY CLAIM the benefit under Title 35, United States Code, §120 of any United 
States application(s), or § 365(c) of any PCT international application designating the United 
States of America, listed below and, insofar as the subject matter of each of the claims of this 
application is not disclosed in the prior United States or PCT International application in the 
manner provided by the first paragraph of Title 35, United States Code, § 1 1 2, I acknowledge 
the duty to disclose information which is material to patentability as defined in Title 37, Code of 
Federal Regulations, § 1 .56 which became available between the filing date of the prior 
application and the national or PCT international filing date of this application. 



j U.S. Parent 

| Application Number 


PCT Parent 
Application Number 


Parent 
Filing Date 


Parent I 
Patent Number | 








=4 



















I HEREBY APPOINT the registered attorneys and agents at Customer Number 22428 




22428 

PATENT TRADEMARK OFFICE 

to have full power to prosecute this application and any continuations, divisions, reissues, and 
reexaminations thereof, to receive the patent, and to transact all business in the United States 
Patent and Trademark Office connected therewith. 
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Atty. Dkt. No. 078883-0146 



I request that all correspondence be directed to: 



^Bernhard_[ X_SLaxe 
FOLEY & L ARDNER 
Customer Number: 22428 



22.42.8 

PATENT TRADEMARK OFFICE 



Telephone: 
Facsimile: 



(202) 672-5427 
(202) 672-5399 



I UNDERSTAND AND AGREE THAT the foregoing attorneys and agents appointed by me 
to prosecute this application do not personally represent me or my legal interests, but instead 
represent the interests of the legal owner(s) of the invention described in this application. 

I FURTHER DECLARE THAT all statements made herein of my own knowledge are true, 
and that all statements made on information and belief are believed to be true; and further that 
these statements were made with the knowledge that willful false statements and the like so 
made are punishable by fine or imprisonment, or both, under Section 1001 of Title 18 of the 
United States Code, and that such willful false statements may jeopardize the validity of the 
application or any patent issuing thereon. 



Name of first inventor 

Residence 

Citizenship 

Post Office Address 

Inventor's signature 

Date 



Jason SLINGSBY 



London, Great Britain 



Great Britain 



91 Lacy Road, Putney, London SW1 5 1NR, Great Britain 



? 



Name of second inventor 

Residence 

Citizenship 



Post Office Address 



Inventor's signature 
Date 



Oxforj^Great Britain C^^^^l 



Great Britain 



c/o Oxford BioMedica (UK) Ltd. 
Medwar Centre 
Robert Robinson Avenue 
The Oxford Science Park 
Oxford 
OX4 4GA 
Great Britajq^? 
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yd 



Name of third inventor 

Residence 

Citizenship 



Post Office Address 



Inventor's signature 
Date 

Name of fourth inventor 

Residence 

Citizenship 



Post Office Address 



Inventor's signature 
Date 



Jonathan ROHLL 



JDxfar.d, Great Britain 



Great Britain 



c/o Oxford BioMedica (UK) Ltd. 
Medawar Centre 
Robert Robinson Avenue 
The Oxford Science Park 
Oxford 
0X4 4GA 
Great Bjttain 







Andrew SLADE 


Oxford, Great Britain 




Great Britain Hte^ 





c/o Oxford BioMedica (UK) Ltd. 
Medawar Centre 
Robert Robinson Avenue 
The Oxford Science Park 
Oxford 
0X4 4GA 

Great Britain 



\^ fegoL Too?. 



002.800057.1 



Page 4 of 4 



.i O O 33 S O 7 6« * O 7 <J O H 



l 

SEQUENCE LISTING 

<110> SLINGSBY, JASON 
KINGSMAN, SUSAN 
ROHL, JONATHAN 
SLADE , ANDREW 

<12 0> PRODUCER CELL FOR THE PRODUCTION OF RETROVIRAL VECTORS 

<130> 078883-0146 

<140> 10/088,076 
<141> 2002-03-20 

<150> PCT/GB00/03837 
<151> 2000-10-05 

<150> GB 9923558.2 
<151> 1999-10-05 

<160> 65 

<170> Patentln Ver. 2.1 

<210> 1 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 1 

caaagcatgc ctgcaggaat teg 



<210> 2 
<211> 55 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 2 

gccaaaccta caggtggggt ctttcattat aaaacccctc ataaaaaccc cacag 



<210> 3 
<211> 176 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
oligonucleotide 
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<400> 3 

caaagcatgc ctgcaggaat tcgatatcaa 
ttaaatcctg gcacatctca tgtatcaatg 
actgtggggt ttttatgagg ggttttataa 



2 



gcttatcgat accgtcgaat tggaagagct 60 
cctcagtatg tttagaaaaa caagggggga 12 0 
tgaaagaccc cacctgtagg tttggc 176 



<210> 4 
<211> 55 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 4 

ctgtggggtt tttatgaggg gttttataat gaaagacccc acctgtaggt ttggc 



<210> 5 
<211> 56 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artifical Sequence: Primer 
<400> 5 

gaagggactc agaccgcaga atctgagtgc cccccgagtg aggggttgtg ggctct 



<210> 6 
<211> 508 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: Synthetic 
oligonucleotide 

<400> 6 

ctgtggggtt tttatgaggg gttttataat gaaagacccc acctgtaggt ttggcaagct 60 

agcttaagta acgccatttt gcaaggcatg gaaaaataca taactgagaa tagagaagtt 12 0 

cagatcaagg tcaggaacag atggaacagc tgaatatggg ccaaacagga tatctgtggt 180 

aagcagttcc tgccccggct cagggccaag aacagatgga acagctgaat atgggccaaa 24 0 

caggatatct gtggtaagca gttcctgccc cggctcaggg ccaagaacag atggtcccca 3 00 

gatgcggtcc agccctcagc agtttctaga gaaccatcag atgtttccag ggtgccccaa 360 

ggacctgaaa tgaccctgtg ccttatttga actaaccaat cagttcgctt ctcgcttctg 420 

ttcgcgcgct tctgctcccc gagctcaata aaagagccca caacccctca ctcggggggc 480 

actcagattc tgcggtctga gtcccttc 508 



<210> 7 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
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<400> 7 

gagcgcagcg agtcagtgag cgag 24 

<210> 8 
<211> 56 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 8 

agagcccaca acccctcact cggggggcac tcagattctg cggtctgagt cccttc 56 

<210> 9 

<211> 419 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
oligonucleotide 

<400> 9 

agagcccaca acccctcact cggggggcac tcagattctg cggtctgagt cccttctctg 60 

ctgggctgaa aaggcctttg taataaatat aattctctac tcagtccctg tctctagttt 120 

gtctgttcga gatcctacag agctcatgcc ttggcgtaat catggtcata gctgtttcct 180 

gtgtgaaatt gttatccgct cacaattcca cacaacatac gagccggaag cataaagtgt 24 0 

aaagcctggg gtgcctaatg agtgagctaa ctcacattaa ttgcgttgcg ctcactgccc 300 

gctttccagt cgggaaacct gtcgtgccag ctgcattaat gaatcggcca acgcgcgggg 360 

agaggcggtt tgcgtattgg gcgctcttcc gcttcctcgc tcactgactc gctgcgctc 419 

<210> 10 
<211> 9138 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
oligonucleotide 

<400> 10 

agatcttgaa taataaaatg tgtgtttgtc cgaaatacgc gttttgagat ttctgtcgcc 60 

gactaaattc atgtcgcgcg atagtggtgt ttatcgccga tagagatggc gatattggaa 12 0 

aaattgatat ttgaaaatat ggcatattga aaatgtcgcc gatgtgagtt tctgtgtaac 180 

tgatatcgcc atttttccaa aagtgatttt tgggcatacg cgatatctgg cgatagcgct 240 

tatatcgttt acgggggatg gcgatagacg actttggtga cttgggcgat tctgtgtgtc 3 00 

gcaaatatcg cagtttcgat ataggtgaca gacgatatga ggctatatcg ccgatagagg 360 

cgacatcaag ctggcacatg gccaatgcat atcgatctat acattgaatc aatattggcc 420 

attagccata ttattcattg gttatatagc ataaatcaat attggctatt ggccattgca 



480 



tacgttgtat ccatatcgta atatgtacat ttatattggc tcatgtccaa cattaccgcc 540 

atgttgacat tgattattga ctagttatta atagtaatca attacggggt cattagttca 600 

tagcccatat atggagttcc gcgttacata acttacggta aatggcccgc ctggctgacc 660 

gcccaacgac ccccgcccat tgacgtcaat aatgacgtat gttcccatag taacgccaat 720 
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agggactttc cattgacgtc aatgggtgga gtatttacgg taaactgccc acttggcagt 780 

acatcaagtg tatcatatgc caagtccgcc ccctattgac gtcaatgacg gtaaatggcc 840 

cgcctggcat tatgcccagt acatgacctt acgggacttt cctacttggc agtacatcta 900 

cgtattagtc atcgctatta ccatggtgat gcggttttgg cagtacacca atgggcgtgg 960 

atagcggttt gactcacggg gatttccaag tctccacccc attgacgtca atgggagttt 1020 

gttttggcac caaaatcaac gggactttcc aaaatgtcgt aacaactgcg atcgcccgcc 1080 

ccgttgacgc aaatgggcgg taggcgtgta cggtgggagg tctatataag cagagctcgt 114 0 

ttagtgaacc gggcactcag attctgcggt ctgagtccct tctctgctgg gctgaaaagg 1200 

cctttgtaat aaatataatt ctctactcag tccctgtctc tagtttgtct gttcgagatc 1260 

ctacagttgg cgcccgaaca gggacctgag aggggcgcag accctacctg ttgaacctgg 1320 

ctgatcgtag gatccccggg acagcagagg agaacttaca gaagtcttct ggaggtgttc 13 8 0 

ctggccagaa cacaggagga caggtaagat tgggagaccc tttgacattg gagcaaggcg 144 0 

ctcaagaagt tagagaaggt gacggtacaa gggtctcaga aattaactac tggtaactgt 1500 

aattgggcgc taagtctagt agacttattt catgatacca actttgtaaa agaaaaggac 1560 

tggcagctga gggatgtcat tccattgctg gaagatgtaa ctcagacgct gtcaggacaa 162 0 

gaaagagagg cctttgaaag aacatggtgg gcaatttctg ctgtaaagat gggcctccag 1680 

attaataatg tagtagatgg aaaggcatca ttccagctcc taagagcgaa atatgaaaag 1740 

aagactgcta ataaaaagca gtctgagccc tctgaagaat atctctagaa ctagtggatc 1800 

ccccgggctg caggagtggg gaggcacgat ggccgctttg gtcgaggcgg atccggccat 1860 

tagccatatt attcattggt tatatagcat aaatcaatat tggctattgg ccattgcata 1920 

cgttgtatcc atatcataat atgtacattt atattggctc atgtccaaca ttaccgccat 1980 

gttgacattg attattgact agttattaat agtaatcaat tacggggtca ttagttcata 2040 

gcccatatat ggagttccgc gttacataac ttacggtaaa tggcccgcct ggctgaccgc 2100 

ccaacgaccc ccgcccattg acgtcaataa tgacgtatgt tcccatagta acgccaatag 2160 

ggactttcca ttgacgtcaa tgggtggagt atttacggta aactgcccac ttggcagtac 2220 

atcaagtgta tcatatgcca agtacgcccc ctattgacgt caatgacggt aaatggcccg 2280 

cctggcatta tgcccagtac atgaccttat gggactttcc tacttggcag tacatctacg 2340 

tattagtcat cgctattacc atggtgatgc ggttttggca gtacatcaat gggcgtggat 2400 

agcggtttga ctcacgggga tttccaagtc tccaccccat tgacgtcaat gggagtttgt 2460 

tttggcacca aaatcaacgg gactttccaa aatgtcgtaa caactccgcc ccattgacgc 2520 

aaatgggcgg taggcatgta cggtgggagg tctatataag cagagctcgt ttagtgaacc 2580 

gtcagatcgc ctggagacgc catccacgct gttttgacct ccatagaaga caccgggacc 2 64 0 

gatccagcct ccgcggcccc aagcttcagc tgctcgagga tctgcggatc cggggaattc 2700 

cccagtctca ggatccacca tgggggatcc cgtcgtttta caacgtcgtg actgggaaaa 2760 

ccctggcgtt acccaactta atcgccttgc agcacatccc cctttcgcca gctggcgtaa 2 820 

tagcgaagag gcccgcaccg atcgcccttc ccaacagttg cgcagcctga atggcgaatg 2880 

gcgctttgcc tggtttccgg caccagaagc ggtgccggaa agctggctgg agtgcgatct 2 94 0 

tcctgaggcc gatactgtcg tcgtcccctc aaactggcag atgcacggtt acgatgcgcc 3000 

catctacacc aacgtaacct atcccattac ggtcaatccg ccgtttgttc ccacggagaa 3 060 

tccgacgggt tgttactcgc tcacatttaa tgttgatgaa agctggctac aggaaggcca 312 0 

gacgcgaatt atttttgatg gcgttaactc ggcgtttcat ctgtggtgca acgggcgctg 3180 

ggtcggttac ggccaggaca gtcgtttgcc gtctgaattt gacctgagcg catttttacg 3240 

cgccggagaa aaccgcctcg cggtgatggt gctgcgttgg agtgacggca gttatctgga 3300 

agatcaggat atgtggcgga tgagcggcat tttccgtgac gtctcgttgc tgcataaacc 3360 

gactacacaa atcagcgatt tccatgttgc cactcgcttt aatgatgatt tcagccgcgc 3420 

tgtactggag gctgaagttc agatgtgcgg cgagttgcgt gactacctac gggtaacagt 3480 

ttctttatgg cagggtgaaa cgcaggtcgc cagcggcacc gcgcctttcg gcggtgaaat 3540 

tatcgatgag cgtggtggtt atgccgatcg cgtcacacta cgtctgaacg tcgaaaaccc 3600 

gaaactgtgg agcgccgaaa tcccgaatct ctatcgtgcg gtggttgaac tgcacaccgc 3660 

cgacggcacg ctgattgaag cagaagcctg cgatgtcggt ttccgcgagg tgcggattga 3720 

aaatggtctg ctgctgctga acggcaagcc gttgctgatt cgaggcgtta accgtcacga 3780 

gcatcatcct ctgcatggtc aggtcatgga tgagcagacg atggtgcagg atatcctgct 3 84 0 

gatgaagcag aacaacttta acgccgtgcg ctgttcgcat tatccgaacc atccgctgtg 3 900 

gtacacgctg tgcgaccgct acggcctgta tgtggtggat gaagccaata ttgaaaccca 3960 

cggcatggtg ccaatgaatc gtctgaccga tgatccgcgc tggctaccgg cgatgagcga 4 02 0 

acgcgtaacg cgaatggtgc agcgcgatcg taatcacccg agtgtgatca tctggtcgct 4 080 

ggggaatgaa tcaggccacg gcgctaatca cgacgcgctg tatcgctgga tcaaatctgt 4140 

cgatccttcc cgcccggtgc agtatgaagg cggcggagcc gacaccacgg ccaccgatat 42 00 
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tatttgcccg atgtacgcgc gcgtggatga agaccagccc ttcccggctg tgccgaaatg 4260 

gtccatcaaa aaatggcttt cgctacctgg agagacgcgc ccgctgatcc tttgcgaata 4320 

cgcccacgcg atgggtaaca gtcttggcgg tttcgctaaa tactggcagg cgtttcgtca 43 80 

gtatccccgt ttacagggcg gcttcgtctg ggactgggtg gatcagtcgc tgattaaata 4440 

tgatgaaaac ggcaacccgt ggtcggctta cggcggtgat tttggcgata cgccgaacga 4500 

tcgccagttc tgtatgaacg gtctggtctt tgccgaccgc acgccgcatc cagcgctgac 4560 

ggaagcaaaa caccagcagc agtttttcca gttccgttta tccgggcaaa ccatcgaagt 4620 

gaccagcgaa tacctgttcc gtcatagcga taacgagctc ctgcactgga tggtggcgct 4680 

ggatggtaag ccgctggcaa gcggtgaagt gcctctggat gtcgctccac aaggtaaaca 4740 

gttgattgaa ctgcctgaac taccgcagcc ggagagcgcc gggcaactct ggctcacagt 4800 

acgcgtagtg caaccgaacg cgaccgcatg gtcagaagcc gggcacatca gcgcctggca 4860 

gcagtggcgt ctggcggaaa acctcagtgt gacgctcccc gccgcgtccc acgccatccc 4 92 0 

gcatctgacc accagcgaaa tggatttttg catcgagctg ggtaataagc gttggcaatt 4980 

taaccgccag tcaggctttc tttcacagat gtggattggc gataaaaaac aactgctgac 5040 

gccgctgcgc gatcagttca cccgtgcacc gctggataac gacattggcg taagtgaagc 5100 

gacccgcatt gaccctaacg cctgggtcga acgctggaag gcggcgggcc attaccaggc 5160 

cgaagcagcg ttgttgcagt gcacggcaga tacacttgct gatgcggtgc tgattacgac 5220 

cgctcacgcg tggcagcatc aggggaaaac cttatttatc agccggaaaa cctaccggat 52 80 

tgatggtagt ggtcaaatgg cgattaccgt tgatgttgaa gtggcgagcg atacaccgca 5340 

tccggcgcgg attggcctga actgccagct ggcgcaggta gcagagcggg taaactggct 5400 

cggattaggg ccgcaagaaa actatcccga ccgccttact gccgcctgtt ttgaccgctg 5460 

ggatctgcca ttgtcagaca tgtatacccc gtacgtcttc ccgagcgaaa acggtctgcg 5520 

ctgcgggacg cgcgaattga attatggccc acaccagtgg cgcggcgact tccagttcaa 5580 

catcagccgc tacagtcaac agcaactgat ggaaaccagc catcgccatc tgctgcacgc 5640 

ggaagaaggc acatggctga atatcgacgg tttccatatg gggattggtg gcgacgactc 5700 

ctggagcccg tcagtatcgg cggaattcca gctgagcgcc ggtcgctacc attaccagtt 5760 

ggtctggtgt caaaaataat aataaccggg caggggggat ccgcagatcc ggctgtggaa 5 82 0 

tgtgtgtcag ttagggtgtg gaaagtcccc aggctcccca gcaggcagaa gtatgcaaag 5880 

catgcctgca ggaattcgat atcaagctta tcgataccgt cgaattggaa gagctttaaa 5940 

tcctggcaca tctcatgtat caatgcctca gtatgtttag aaaaacaagg ggggaactgt 6000 

ggggttttta tgaggggttt tataatgaaa gaccccacct gtaggtttgg caagctagct 6060 

taagtaacgc cattttgcaa ggcatggaaa aatacataac tgagaataga gaagttcaga 6120 

tcaaggtcag gaacagatgg aacagctgaa tatgggccaa acaggatatc tgtggtaagc 6180 

agttcctgcc ccggctcagg gccaagaaca gatggaacag ctgaatatgg gccaaacagg 6240 

atatctgtgg taagcagttc ctgccccggc tcagggccaa gaacagatgg tccccagatg 6300 

cggtccagcc ctcagcagtt tctagagaac catcagatgt ttccagggtg ccccaaggac 6360 

ctgaaatgac cctgtgcctt atttgaacta accaatcagt tcgcttctcg cttctgttcg 6420 

cgcgcttctg ctccccgagc tcaataaaag agcccacaac ccctcactcg gggggcactc 6480 

agattctgcg gtctgagtcc cttctctgct gggctgaaaa ggcctttgta ataaatataa 6540 

ttctctactc agtccctgtc tctagtttgt ctgttcgaga tcctacagag ctcatgcctt 6600 

ggcgtaatca tggtcatagc tgtttcctgt gtgaaattgt tatccgctca caattccaca 6660 

caacatacga gccggaagca taaagtgtaa agcctggggt gcctaatgag tgagctaact 6720 

cacattaatt gcgttgcgct cactgcccgc tttccagtcg ggaaacctgt cgtgccagct 6780 

gcattaatga atcggccaac gcgcggggag aggcggtttg cgtattgggc gctcttccgc 6840 

ttcctcgctc actgactcgc tgcgctcggt cgttcggctg cggcgagcgg tatcagctca 6900 

ctcaaaggcg gtaatacggt tatccacaga atcaggggat aacgcaggaa agaacatgtg 6960 

agcaaaaggc cagcaaaagg ccaggaaccg taaaaaggcc gcgttgctgg cgtttttcca 7020 

taggctccgc ccccctgacg agcatcacaa aaatcgacgc tcaagtcaga ggtggcgaaa 7080 

cccgacagga ctataaagat accaggcgtt tccccctgga agctccctcg tgcgctctcc 7140 

tgttccgacc ctgccgctta ccggatacct gtccgccttt ctcccttcgg gaagcgtggc 7200 

gctttctcat agctcacgct gtaggtatct cagttcggtg taggtcgttc gctccaagct 7260 

gggctgtgtg cacgaacccc ccgttcagcc cgaccgctgc gccttatccg gtaactatcg 7320 

tcttgagtcc aacccggtaa gacacgactt atcgccactg gcagcagcca ctggtaacag 73 80 

gattagcaga gcgaggtatg taggcggtgc tacagagttc ttgaagtggt ggcctaacta 744 0 

cggctacact agaaggacag tatttggtat ctgcgctctg ctgaagccag ttaccttcgg 7500 

aaaaagagtt ggtagctctt gatccggcaa acaaaccacc gctggtagcg gtggtttttt 7560 

tgtttgcaag cagcagatta cgcgcagaaa aaaaggatct caagaagatc ctttgatctt 7620 

ttctacgggg tctgacgctc agtggaacga aaactcacgt taagggattt tggtcatgag 7680 
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attatcaaaa 
ctaaagtata 
tatctcagcg 
aactacgata 
acgctcaccg 
aagtggtcct 
agtaagtagt 
ggtgtcacgc 
agttacatga 
tgtcagaagt 
tcttactgtc 
attctgagaa 
taccgcgcca 
aaaactctca 
caactgatct 
gcaaaatgcc 
cctttttcaa 
tgaatgtatt 
acctaaattg 
tcatttttta 
gagatagggt 
tccaacgtca 
ccctaatcaa 
agcccccgat 
cactataggg 



aggatcttca 

tatgagtaaa 

atctgtctat 

cgggagggct 

gctccagatt 

gcaactttat 

tcgccagtta 

tcgtcgtttg 

tcccccatgt 

aagttggccg 

atgccatccg 

tagtgtatgc 

catagcagaa 

aggatcttac 

tcagcatctt 

gcaaaaaagg 

tattattgaa 

tagaaaaata 

taagcgttaa 

accaataggc 

tgagtgttgt 

aagggcgaaa 

gttttttggg 

ttagagcttg 

agaccggc 



cctagatcct 

cttggtctga 

ttcgttcatc 

taccatctgg 

tatcagcaat 

ccgcctccat 

atagtttgcg 

gtatggcttc 

tgtgcaaaaa 

cagtgttatc 

taagatgctt 

ggcgaccgag 

ctttaaaagt 

cgctgttgag 

ttactttcac 

gaataagggc 

gcatttatca 

aacaaatagg 

tattttgtta 

cgaaatcggc 

tccagtttgg 

aaccgtctat 

gtcgaggtgc 

acggggaaag 



tttaaattaa 
cagttaccaa 
catagttgcc 
ccccagtgct 
aaaccagcca 
ccagtctatt 
caacgttgtt 
attcagctcc 
agcggttagc 
actcatggtt 
ttctgtgact 
ttgctcttgc 
gctcatcatt 
atccagttcg 
cagcgtttct 
gacacggaaa 
gggttattgt 
ggttccgcgc 
aaattcgcgt 
aaaatccctt 
aacaagagtc 
cagggcgatg 
cgtaaagcac 
ccaacctggc 



aaatgaagtt 

tgcttaatca 

tgactccccg 

gcaatgatac 

gccggaaggg 

aattgttgcc 

gccattgcta 

ggttcccaac 

tccttcggtc 

atggcagcac 

ggtgagtact 

ccggcgtcaa 

ggaaaacgtt 

atgtaaccca 

gggtgagcaa 

tgttgaatac 

ctcatgagcg 

acatttcccc 

taaatttttg 

ataaatcaaa 

cactattaaa 

gcccactacg 

taaatcggaa 

ttatcgaaat 



ttaaatcaat 

gtgaggcacc 

tcgtgtagat 

cgcgagaccc 

ccgagcgcag 

gggaagctag 

caggcatcgt 

gatcaaggcg 

ctccgatcgt 

tgcataattc 

caaccaagtc 

tacgggataa 

cttcggggcg 

ctcgtgcacc 

aaacaggaag 

tcatactctt 

gatacatatt 

gaaaagtgcc 

ttaaatcagc 

agaatagacc 

gaacgtggac 

tgaaccatca 

ccctaaaggg 

taatacgact 



7740 
7800 
7860 
7920 
7980 
8040 
8100 
8160 
8220 
8280 
8340 
8400 
8460 
8520 
8580 
8640 
8700 
8760 
8820 
8880 
8940 
9000 
9060 
9120 
9138 



<210> 11 
<211> 56 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 



<400> 11 

ggccatcgtg cctccccact cctgcagtta taaaacccct cataaaaacc ccacag 



56 



<210> 12 
<211> 57 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 12 

ctgtggggtt tttatgaggg gttttataaa ctgcaggagt ggggaggcac gatggcc 57 



<210> 13 

<211> 64 

<212> DNA 

<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: Primer 



<400> 13 

gaagggactc agaccgcaga atctgagtgc ccggttcact aaacgagctc tgcttatata 
gacc 



<210> 14 

<211> 64 

<212> DNA 

<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: Primer 
<400> 14 

ggtctatata agcagagctc gtttagtgaa ccgggcactc agattctgcg gtctgagtcc 
cttc 



<210> 15 

<211> 28 

<212> DNA 

<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: Primer 



<400> 15 

cgagatccta cagttggcgc ccgaacag 



<210> 16 
<211> 98 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 16 

gagttacaat cttccagcaa tggaatgaca atccctcagc tgccagtcct tttcttttac 
aaagttggta tcaatgaaat aagtctacta gacttagc 



<210> 17 
<211> 95 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 17, 

ttccattgct ggaagattgt aactcagacg ctgtcaggac aagaaagaga ggcctttgaa 
agaacattgg tgggcaattt ctgctgtaaa gattg 
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<210> 18 
<211> 98 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 18 

caatatttcg ctcttaggag ctggaatgat gcctttccaa tctactacaa ttattaatct 
ggaggcccaa tctttacagc agaaattgcc caccaatg 



<210> 19 
<211> 83 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 19 

ccactagttc tagagatatt cttcagaggg ctcagactgc tttttattag cagtcttctt 
ttcaatattt cgctcttagg age 



<210> 20 
<211> 552 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
oligonucleotide 

<400> 20 

cgagatccta cagttggcgc ccgaacaggg acctgagagg ggcgcagacc ctacctgttg 60 

aacctggctg ategtaggat ccccgggaca gcagaggaga acttacagaa gtcttctgga 12 0 

ggtgttcctg gccagaacac aggaggacag gtaagattgg gagacccttt gacattggag 180 

caaggcgctc aagaagttag agaaggtgac ggtacaaggg tctcagaaat taactactgg 240 

taactgtaat tgggcgctaa gtctagtaga cttatttcat tgataccaac tttgtaaaag 3 00 

aaaaggactg gcagctgagg gattgtcatt ecattgetgg aagattgtaa ctcagacgct 3 60 

gtcaggacaa gaaagagagg cctttgaaag aacattggtg ggcaatttct gctgtaaaga 420 

ttgggcctcc agattaataa ttgtagtaga ttggaaaggc atcattccag ctcctaagag 480 

cgaaatattg aaaagaagac tgctaataaa aagcagtctg agccctctga agaatatctc 54 0 
tagaactagt gg 



552 



<210> 21 
<211> 40 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
oligonucleotide 
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<400> 21 

gataacttcg tataatgtat gctatacgaa gttatctgca 40 



<210> 22 
<211> 40 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
oligonucleotide 

<400> 22 

gataacttcg tatagcatac attatacgaa gttatctgca 



<210> 23 
<211> 43 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
oligonucleotide 

<400> 23 

gtgataactt cgtataatgt atgctatacg aagttatcac tac 



<210> 24 
<211> 43 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
oligonucleotide 

<400> 24 

gtgataactt cgtatagcat acattatacg aagttatcac gta 



<210> 25 
<211> 40 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
oligonucleotide 

<400> 25 

catgtataac ttcgtataat gtatgctata cgaagttata 



<210> 
<211> 



26 
40 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
oligonucleotide 

<400> 26 

catgtataac ttcgtatagc atacattata cgaagttata 



<210> 27 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
o 1 igonuc 1 eo t ide 

<400> 27 

agtaggccgc ctcggccgcc cgggcatca 



<210> 28 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
oligonucleotide 

<400> 28 

tgatgcccgg gcggccgagg cggcctact 



<210> 29 
<211> 34 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 



<400> 29 

tagccgagat ctcaaattgc ttagcctgat agcc 



<210> 30 
<211> 32 
<212> DNA 

<213> Artificial Sequence 



34 



<220> 

<223> Description of Artificial Sequence: Primer 
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<400> 30 

tgcgtagcta gcctcccggt ggtgggtcgg tg 



<210> 31 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 31 

agcagtagat ctggggttgg ggttgcgcct tt 



<210> 32 
<211> 31 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 32 

cgtcatgcta gcctggggag agaggtcggt g 



<210> 33 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 33 

tacggaagat ctaaatgagt cttcggacct 



<210> 34 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 34 

ctcaacgcta gcgtactcta gccttaagag ctg 



<210> 35 
<211> 35 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: Primer 
<400> 35 

taccagagat cttctagagt cgaccaattc tcatg 



<210> 36 
<211> 34 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 36 

catcgagcta gcagcttgga ggtgcacacc aatg 34 



<210> 37 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Desription of Artificial Sequence: Primer 
<400> 37 

gatggtagat ctgcgcagca ccatggcctg aa 32 



<210> 38 
<211> 34 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 38 

ctcgaagcta gcagcttttt gcaaaagcct aggc 34 



<210> 39 
<211> 515 
<212> DNA 
<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: Synthetic 
oligonucleotide 

<400> 39 

gaattcgcca ccatggctga gagcaaggag gccagggatc aagagatgac ctcaaggaag 
agagcaaaga ggagaagcgc cgcaacgact ggtggaagat cgacccaaag gccccctgga 
gggggaccag tggtgccgcg tgctgagaca gtccctgccc gaggagaaga ttcctagcca 
gacctgcatc gccagaagac acctcggccc cggtcccacc cagcacacac cctccagaag 
ggataggtgg attaggggcc agattttgca agccgaggtc ctccaagaaa ggctggaatg 
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gagaattagg ggcgtgcaac aagccgctaa agagctggga gaggtgaatc gcggcatctg 
gagggagctc tacttccgcg aggaccagag gggcgatttc tccgcatggg gaggctacca 
gagggcacaa gaaaggctgt ggggcgagca gagcagcccc cgcgtcttga ggcccggaga 
ctccaaaaga cgccgcaaac acctgtgaag tcgac 



<210> 40 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
oligonucleotide 

<400> 40 

gatcggccgc ctcggcca 



<210> 41 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
oligonucleotide 

<400> 41 

gatctggccg aggcggcc 



<210> 42 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
oligonucleotide 

<400> 42 

ggccgcctcg gccgta 



<210> 43 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
oligonucleotide 



<400> 43 

ggccgaggcg gcctac 



16 
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<210> 44 
<211> 38 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
oligonucleotide 

<220> 

<221> misc_feature 
<222> (21) - - (22) 

<223> The sequence of the EIAV gag/pol ORF is inserted 
between these bases 

<400> 44 

tctagagaat tcgccaccat ggaacccggg gcggccgc 



60 



<210> 45 
<211> 10384 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
oligonucleotide 

<400> 45 

agatcttgaa taataaaatg tgtgtttgtc cgaaatacgc gttttgagat ttctgtcgcc 

gactaaattc atgtcgcgcg atagtggtgt ttatcgccga tagagatggc gatattggaa 12 0 

aaattgatat ttgaaaatat ggcatattga aaatgtcgcc gatgtgagtt tctgtgtaac 180 

tgatatcgcc atttttccaa aagtgatttt tgggcatacg cgatatctgg cgatagcgct 240 

tatatcgttt acgggggatg gcgatagacg actttggtga cttgggcgat tctgtgtgtc 300 

gcaaatatcg cagtttcgat ataggtgaca gacgatatga ggctatatcg ccgatagagg 360 

cgacatcaag ctggcacatg gccaatgcat atcgatctat acattgaatc aatattggcc 420 

attagccata ttattcattg gttatatagc ataaatcaat attggctatt ggccattgca 480 

tacgttgtat ccatatcgta atatgtacat ttatattggc tcatgtccaa cattaccgcc 540 

atgttgacat tgattattga ctagttatta atagtaatca attacggggt cattagttca 600 

tagcccatat atggagttcc gcgttacata acttacggta aatggcccgc ctggctgacc 660 

gcccaacgac ccccgcccat tgacgtcaat aatgacgtat gttcccatag taacgccaat 720 

agggactttc cattgacgtc aatgggtgga gtatttacgg taaactgccc acttggcagt 780 

acatcaagtg tatcatatgc caagtccgcc ccctattgac gtcaatgacg gtaaatggcc 840 

cgcctggcat tatgcccagt acatgacctt acgggacttt cctacttggc agtacatcta 900 

cgtattagtc atcgctatta ccatggtgat gcggttttgg cagtacacca atgggcgtgg 960 

atagcggttt gactcacggg gatttccaag tctccacccc attgacgtca atgggagttt 1020 

gttttggcac caaaatcaac gggactttcc aaaatgtcgt aacaactgcg atcgcccgcc 1080 

ccgttgacgc aaatgggcgg taggcgtgta cggtgggagg tctatataag cagagctcgt 114 0 

ttagtgaacc gggcactcag attctgcggt ctgagtccct tctctgctgg gctgaaaagg 12 0 0 

cctttgtaat aaatataatt ctctactcag tccctgtctc tagtttgtct gttcgagatc 1260 

ctacagttgg cgcccgaaca gggacctgag aggggcgcag accctacctg ttgaacctgg 1320 

ctgatcgtag gatccccggg acagcagagg agaacttaca gaagtcttct ggaggtgttc 13 8 0 

ctggccagaa cacaggagga caggtaagat tgggagaccc tttgacattg gagcaaggcg 144 0 

ctcaagaagt tagagaaggt gacggtacaa gggtctcaga aattaactac tggtaactgt 1500 

aattgggcgc taagtctagt agacttattt catgatacca actttgtaaa agaaaaggac 1560 

tggcagctga gggatgtcat tccattgctg gaagatgtaa ctcagacgct gtcaggacaa 162 0 

gaaagagagg cctttgaaag aacatggtgg gcaatttctg ctgtaaagat gggcctccag 1680 

attaataatg tagtagatgg aaaggcatca ttccagctcc taagagcgaa atatgaaaag 1740 
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aagactgcta ataaaaagca gtctgagccc tctgaagaat atctctagaa ctagtggatc 1800 

ccccgggctg caggagtggg gaggcacgat ggccgctttg gtcgaggcgg atccggccat 1860 

tagccatatt attcattggt tatatagcat aaatcaatat tggctattgg ccattgcata 192 0 

cgttgtatcc atatcataat atgtacattt atattggctc atgtccaaca ttaccgccat 1980 

gttgacattg attattgact agttattaat agtaatcaat tacggggtca ttagttcata 2040 

gcccatatat ggagttccgc gttacataac ttacggtaaa tggcccgcct ggctgaccgc 2100 

ccaacgaccc ccgcccattg acgtcaataa tgacgtatgt tcccatagta acgccaatag 2160 

ggactttcca ttgacgtcaa tgggtggagt atttacggta aactgcccac ttggcagtac 222 0 

atcaagtgta tcatatgcca agtacgcccc ctattgacgt caatgacggt aaatggcccg 2280 

cctggcatta tgcccagtac atgaccttat gggactttcc tacttggcag tacatctacg 2340 

tattagtcat cgctattacc atggtgatgc ggttttggca gtacatcaat gggcgtggat 2400 

agcggtttga ctcacgggga tttccaagtc tccaccccat tgacgtcaat gggagtttgt 2460 

tttggcacca aaatcaacgg gactttccaa aatgtcgtaa caactccgcc ccattgacgc 2520 

aaatgggcgg taggcatgta cggtgggagg tctatataag cagagctcgt ttagtgaacc 2580 

gtcagatcgc ctggagacgc catccacgct gttttgacct ccatagaaga caccgggacc 2640 

gatccagcct ccgcggcccc aagcttgttg ggatccaccg gtcgccacca tggtgagcaa 2700 

gggcgaggag ctgttcaccg gggtggtgcc catcctggtc gagctggacg gcgacgtaaa 2760 

cggccacaag ttcagcgtgt ccggcgaggg cgagggcgat gccacctacg gcaagctgac 2 82 0 

cctgaagttc atctgcacca ccggcaagct gcccgtgccc tggcccaccc tcgtgaccac 2880 

cctgacctac ggcgtgcagt gcttcagccg ctaccccgac cacatgaagc agcacgactt 2 940 

cttcaagtcc gccatgcccg aaggctacgt ccaggagcgc accatcttct tcaaggacga 3000 

cggcaactac aagacccgcg ccgaggtgaa gttcgagggc gacaccctgg tgaaccgcat 3060 

cgagctgaag ggcatcgact tcaaggagga cggcaacatc ctggggcaca agctggagta 312 0 

caactacaac agccacaacg tctatatcat ggccgacaag cagaagaacg gcatcaaggt 3180 

gaacttcaag atccgccaca acatcgagga cggcagcgtg cagctcgccg accactacca 3240 

gcagaacacc cccatcggcg acggccccgt gctgctgccc gacaaccact acctgagcac 3300 

ccagtccgcc ctgagcaaag accccaacga gaagcgcgat cacatggtcc tgctggagtt 3360 

cgtgaccgcc gccgggatca ctctcggcat ggacgagctg tacaagtaaa gcggccgcga 3420 

ctctagagtc gacctgcagg catgcaagct tcagctgctc gagggggggc ccggtaccca 34 8 0 

gcttttgttc cctttagtga gggttaattg cgcgggaagt atttatcact aatcaagcac 3540 
aagtaataca tgagaaactt ttactacagc aagcacaatc ctccaaaaaa ttttgttttt 3600 
acaaaatccc tggtgaacat gattggaagg gacctactag ggtgctgtgg aagggtgatg 3660 
gtgcagtagt agttaatgat gaaggaaagg gaataattgc tgtaccatta accaggacta 3 72 0 
agttactaat aaaaccaaat tgagtattgt tgcaggaagc aagacccaac taccattgtc 3780 
agctgtgttt cctgacctca atatttgtta taaggtttga tatgaatccc agggggaatc 3 840 
tcaaccccta ttacccaaca gtcagaaaaa tctaagtgtg aggagaacac aatgtttcaa 3900 
ccttattgtt ataataatga cagtaagaac agcatggcag aatcgaagga agcaagagac 3 960 
caagaatgaa cctgaaagaa gaatctaaag aagaaaaaag aagaaatgac tggtggaaaa 4020 
taggtatgtt tctgttatgc ttagcaggaa ctactggagg aatactttgg tggtatgaag 4080 
gactcccaca gcaacattat atagggttgg tggcgatagg gggaagatta aacggatctg 4140 
gccaatcaaa tgctatagaa tgctggggtt ccttcccggg gtgtagacca tttcaaaatt 4200 
acttcagtta tgagaccaat agaagcatgc atatggataa taatactgct acattattag 4260 
aagctttaac caatataact gctctataaa taacaaaaca gaattagaaa catggaagtt 4320 
agtaaagact tctggcataa ctcctttacc tatttcttct gaagctaaca ctggactaat 43 80 
tagacataag agagattttg gtataagtgc aatagtggca gctattgtag ccgctactgc 4440 
tattgctgct agcgctacta tgtcttatgt tgctctaact gaggttaaca aaataatgga 4500 
agtacaaaat catacttttg aggtagaaaa tagtactcta aatggtatgg atttaataga 4560 
acgacaaata aagatattat atgctatgat tcttcaaaca catgcagatg ttcaactgtt 4620 
aaaggaaaga caacaggtag aggagacatt taatttaatt ggatgtatag aaagaacaca 4680 
tgtattttgt catactggtc atccctggaa tatgtcatgg ggacatttaa atgagtcaac 4740 
acaatgggat gactgggtaa gcaaaatgga agatttaaat caagagatac taactacact 4800 
tcatggagcc aggaacaatt tggcacaatc catgataaca ttcaatacac cagatagtat 4860 
agctcaattt ggaaaagacc tttggagtca tattggaaat tggattcctg gattgggagc 4 92 0 
ttccattata aaatatatag tgatgttttt gcttatttat ttgttactaa cctcttcgcc 4980 
taagatcctc agggccctct ggaaggtgac cagtggtgca gggtcctccg gcagtcgtta 5040 
cctgaagaaa aaattccatc acaaacatgc atcgcgagaa gacacctggg accaggccca 5100 
acacaacata cacctagcag gcgtgaccgg tggatcaggg gacaaatact acaagcagaa 5160 
gtactccagg aacgactgga atggagaatc agaggagtac aacaggcggc caaagagctg 5220 
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ggtgaagtca atcgaggcat ttggagagag ctatatttcc gagaagacca aaggggagat 5280 

ttctcagcct ggggcggcta tcaacgagca caagaacggc tctgggggga acaatcctca 5340 

ccaagggtcc ttagacctgg agattcgaag cgaaggagga aacatttatg actgttgcat 5400 

taaagcccaa gaaggaactc tcgctatccc ttgctgtgga tttcccttat ggctattttg 5460 

gggactagta attatagtag gacgcatagc aggctatgga ttacgtggac tcgctgttat 552 0 

aataaggatt tgtattagag gcttaaattt gatatttgaa ataatcagaa aaatgcttga 5580 

ttatattgga agagctttaa atcctggcac atctcatgta tcaatgcctc agtatgttta 5640 

gaaaaacaag gggggaactg tggggttttt atgaggggtt ttataaatga ttataagagt 5700 

aaaaagaaag ttgctgatgc tctcataacc ttgtataacc caaaggacta gctcatgttg 5760 

ctaggcaact aaaccgcaat aaccgcattt gtgacgcgag ttccccattg gtgacgcgtt 5 82 0 

aacttcctgt ttttacagta tataagtgct tgtattctga caattgggca ctcagattct 5880 

gcggtctgag tcccttctct gctgggctga aaaggccttt gtaataaata taattctcta 5940 

ctcagtccct gtctctagtt tgtctgttcg agatcctaca gagctcatgc cttggcgtaa 6000 

tcatggtcat agctgtttcc tgtgtgaaat tgttatccgc tcacaattcc acacaacata 6060 

cgagccggaa gcataaagtg taaagcctgg ggtgcctaat gagtgagcta actcacatta 6120 

attgcgttgc gctcactgcc cgctttccag tcgggaaacc tgtcgtgcca gtgatgcccg 6180 

ggcggccgag gcggcctacg tgaaccatca cccaaatcaa gttttttgcg gtcgaggtgc 6240 

cgtaaagctc taaatcggaa ccctaaaggg agcccccgat ttagagcttg acggggaaag 63 00 

ccggcgaacg tggcgagaaa ggaagggaag aaagcgaaag gagcgggcgc tagggcgctg 6360 

gcaagtgtag cggtcacgct gcgcgtaacc accacacccg ccgcgcttaa tgcgccgcta 642 0 

cagggcgcgt ccattcgcca ttcaggctgc gcaactgttg ggaagggcga tcggtgcggg 6480 

cctcttcgct attacgccag cccggatcga tccttatcgg attttaccac atttgtagag 6540 

gttttacttg ctttaaaaaa cctcccacat ctccccctga acctgaaaca taaaatgaat 6600 

gcaattgttg ttgttaactt gtttattgca gcttataatg gttacaaata aagcaatagc 6660 

atcacaaatt tcacaaataa agcatttttt tcactgcatt ctagttgtgg tttgtccaaa 6720 

ctcatcaatg tatcttatca tgtctgctcg aagcattaac cctcactaaa gggaagcggc 6780 

cgcccgggtc gacttcacag gtgtttgcgg cgtcttttgg agtctccggg cctcaagacg 6840 

cgggggctgc tctgctcgcc ccacagcctt tcttgtgccc tctggtagcc tccccatgcg 6900 

gagaaatcgc ccctctggtc ctcgcggaag tagagctccc tccagatgcc gcgattcacc 6960 

tctcccagct ctttagcggc ttgttgcacg cccctaattc tccattccag cctttcttgg 7020 

aggacctcgg cttgcaaaat ctggccccta atccacctat cccttctgga gggtgtgtgc 7080 

tgggtgggac cggggccgag gtgtcttctg gcgatgcagg tctggctagg aatcttctcc 7140 

tcgggcaggg actgtctcag cacgcggcac cactggtccc cctccagggg gccttgtggg 7200 

tcgatcttcc accagtcgtt gcggcgcttc tcctctttgc tctcttcctt gaggttcatc 7260 

tcttgatccc tggcctcctt gctctcagcc atggtggcga attctcgagg ctagcctccc 7320 

ggtggtgggt cggtggtccc tgggcagggg tctccagatc ccggacgagc ccccaaatga 73 80 

aagacccccg agacgggtag tcaatcactc tgaggagacc ctcccaagga acagcgagac 7440 

cacgagtcgg atgcaacagc aagaggattt attggataca cgggtacccg ggcgactcag 7500 

tctatcggag gactggcgcg ccgagtgagg ggttgtgagc tcttttatag agctcgggaa 7560 

gcagaagcgc gcgaacagaa gcgagaagca ggctgattgg ttaattcaaa taaggcacag 762 0 

ggtcatttca ggtccttggg ggagcctgga aacatctgat gggtcttaag aaactgctga 7680 

gggttgggcc atatctgggg accatctgtt cttggccccg ggccggggcc gaaccgcggt 774 0 

gaccatctgt tcttggcccc gggccggggc cgaaactgct caccgcagat atcctgtttg 7800 

gcccaacgtt agctgttttc gtgtacccgc ccttgatctg aacttctcta ttcttggttt 7860 

ggtatttttc catgccttgc aaaatggcgt tactgcggct atcaggctaa gcaatttgag 7920 

atctggccga ggcggcctac tctgcattaa tgaatcggcc aacgcgcggg gagaggcggt 7980 

ttgcgtattg ggcgctcttc cgcttcctcg ctcactgact cgctgcgctc ggtcgttcgg 8040 

ctgcggcgag cggtatcagc tcactcaaag gcggtaatac ggttatccac agaatcaggg 8100 

gataacgcag gaaagaacat gtataacttc gtataatgta tgctatacga agttatacat 8160 

gtgagcaaaa ggccagcaaa aggccaggaa ccgtaaaaag gccgcgttgc tggcgttttt 8220 

ccataggctc cgcccccctg acgagcatca caaaaatcga cgctcaagtc agaggtggcg 82 80 

aaacccgaca ggactataaa gataccaggc gtttccccct ggaagctccc tcgtgcgctc 8340 

tcctgttccg accctgccgc ttaccggata cctgtccgcc tttctccctt cgggaagcgt 8400 

ggcgctttct catagctcac gctgtaggta tctcagttcg gtgtaggtcg ttcgctccaa 8460 

gctgggctgt gtgcacgaac cccccgttca gcccgaccgc tgcgccttat ccggtaacta 8520 

tcgtcttgag tccaacccgg taagacacga cttatcgcca ctggcagcag ccactggtaa 8580 

caggattagc agagcgaggt atgtaggcgg tgctacagag ttcttgaagt ggtggcctaa 864 0 

ctacggctac actagaagga cagtatttgg tatctgcgct ctgctgaagc cagttacctt 8700 
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cggaaaaaga gttggtagct cttgatccgg caaacaaacc accgctggta gcggtggttt 8760 

ttttgtttgc aagcagcaga ttacgcgcag aaaaaaagga tctcaagaag atcctttgat 8820 

cttttctacg gggtctgacg ctcagtggaa cgaaaactca cgttaaggga ttttggtcat 8880 

gagattatca aaaaggatct tcacctagat ccttttaaat taaaaatgaa gttttaaatc 8 94 0 

aatctaaagt atatatgagt aaacttggtc tgacagttac caatgcttaa tcagtgaggc 9000 

acctatctca gcgatctgtc tatttcgttc atccatagtt gcctgactcc ccgtcgtgta 9060 

gataactacg atacgggagg gcttaccatc tggccccagt gctgcaatga taccgcgaga 912 0 

cccacgctca ccggctccag atttatcagc aataaaccag ccagccggaa gggccgagcg 9180 

cagaagtggt cctgcaactt tatccgcctc catccagtct attaattgtt gccgggaagc 9240 

tagagtaagt agttcgccag ttaatagttt gcgcaacgtt gttgccattg ctacaggcat 9300 

cgtggtgtca cgctcgtcgt ttggtatggc ttcattcagc tccggttccc aacgatcaag 9360 

gcgagttaca tgatccccca tgttgtgcaa aaaagcggtt agctccttcg gtcctccgat 9420 

cgttgtcaga agtaagttgg ccgcagtgtt atcactcatg gttatggcag cactgcataa 9480 

ttctcttact gtcatgccat ccgtaagatg cttttctgtg actggtgagt actcaaccaa 9540 

gtcattctga gaatagtgta tgcggcgacc gagttgctct tgcccggcgt caatacggga 9600 

taataccgcg ccacatagca gaactttaaa agtgctcatc attggaaaac gttcttcggg 9660 

gcgaaaactc tcaaggatct taccgctgtt gagatccagt tcgatgtaac ccactcgtgc 9720 

acccaactga tcttcagcat cttttacttt caccagcgtt tctgggtgag caaaaacagg 9780 

aaggcaaaat gccgcaaaaa agggaataag ggcgacacgg aaatgttgaa tactcatact 9840 

cttccttttt caatattatt gaagcattta tcagggttat tgtctcatga gcggatacat 9900 

atttgaatgt atttagaaaa ataaacaaat aggggttccg cgcacatttc cccgaaaagt 9960 

gccacctaaa ttgtaagcgt taatattttg ttaaaattcg cgttaaattt ttgttaaatc 10020 

agctcatttt ttaaccaata ggccgaaatc ggcaaaatcc cttataaatc aaaagaatag 10080 

accgagatag ggttgagtgt tgttccagtt tggaacaaga gtccactatt aaagaacgtg 1014 0 

gactccaacg tcaaagggcg aaaaaccgtc tatcagggcg atggcccact acgtgataac 10200 

ttcgtataat gtatgctata cgaagttatc actacgtgaa ccatcaccct aatcaagttt 10260 

tttggggtcg aggtgccgta aagcactaaa tcggaaccct aaagggagcc cccgatttag 1032 0 

agcttgacgg ggaaagccaa cctggcttat cgaaattaat acgactcact atagggagac 10380 

cggc " ~ 10384 



<210> 46 
<211> 10384 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
oligonucleotide 



<400> 46 

agatcttgaa 

gactaaattc 

aaattgatat 

tgatatcgcc 

tatatcgttt 

gcaaatatcg 

cgacatcaag 

attagccata 

tacgttgtat 

atgttgacat 

tagcccatat 

gcccaacgac 

agggactttc 

acatcaagtg 

cgcctggcat 

cgtattagtc 

atagcggttt 



taataaaatg 
atgtcgcgcg 
ttgaaaatat 
atttttccaa 
acgggggatg 
cagtttcgat 
ctggcacatg 
ttattcattg 
ccatatcgta 
tgattattga 
atggagttcc 
ccccgcccat 
cattgacgtc 
tatcatatgc 
tatgcccagt 
atcgctatta 
gactcacggg 



tgtgtttgtc 
atagtggtgt 
ggcatattga 
aagtgatttt 
gcgatagacg 
ataggtgaca 
gccaatgcat 
gttatatagc 
atatgtacat 
ctagttatta 
gcgttacata 
tgacgtcaat 
aatgggtgga 
caagtccgcc 
acatgacctt 
ccatggtgat 
gatttccaag 



cgaaatacgc 
ttatcgccga 
aaatgtcgcc 
tgggcatacg 
actttggtga 
gacgatatga 
atcgatctat 
ataaatcaat 
ttatattggc 
atagtaatca 
acttacggta 
aatgacgtat 
gtatttacgg 
ccctattgac 
acgggacttt 
gcggttttgg 
tctccacccc 



gttttgagat 
tagagatggc 
gatgtgagtt 
cgatatctgg 
cttgggcgat 
ggctatatcg 
acattgaatc 
attggctatt 
tcatgtccaa 
attacggggt 
aatggcccgc 
gttcccatag 
taaactgccc 
gtcaatgacg 
cctacttggc 
cagtacacca 
attgacgtca 



ttctgtcgcc 
gatattggaa 
tctgtgtaac 
cgatagcgct 
tctgtgtgtc 
ccgatagagg 
aatattggcc 
ggccattgca 
cattaccgcc 
cattagttca 
ctggctgacc 
taacgccaat 
acttggcagt 
gtaaatggcc 
agtacatcta 
atgggcgtgg 
atgggagttt 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
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gttttggcac caaaatcaac gggactttcc aaaatgtcgt aacaactgcg atcgcccgcc 1080 

ccgttgacgc aaatgggcgg taggcgtgta cggtgggagg tctatataag cagagctcgt 114 0 

ttagtgaacc gggcactcag attctgcggt ctgagtccct tctctgctgg gctgaaaagg 1200 

cctttgtaat aaatataatt ctctactcag tccctgtctc tagtttgtct gttcgagatc 1260 

ctacagttgg cgcccgaaca gggacctgag aggggcgcag accctacctg ttgaacctgg 1320 

ctgatcgtag gatccccggg acagcagagg agaacttaca gaagtcttct ggaggtgttc 1380 

ctggccagaa cacaggagga caggtaagat tgggagaccc tttgacattg gagcaaggcg 1440 

ctcaagaagt tagagaaggt gacggtacaa gggtctcaga aattaactac tggtaactgt 1500 

aattgggcgc taagtctagt agacttattt catgatacca actttgtaaa agaaaaggac 1560 

tggcagctga gggatgtcat tccattgctg gaagatgtaa ctcagacgct gtcaggacaa 162 0 

gaaagagagg cctttgaaag aacatggtgg gcaatttctg ctgtaaagat gggcctccag 1680 

attaataatg tagtagatgg aaaggcatca ttccagctcc taagagcgaa atatgaaaag 174 0 

aagactgcta ataaaaagca gtctgagccc tctgaagaat atctctagaa ctagtggatc 1800 

ccccgggctg caggagtggg gaggcacgat ggccgctttg gtcgaggcgg atccggccat 18 60 

tagccatatt attcattggt tatatagcat aaatcaatat tggctattgg ccattgcata 1920 

cgttgtatcc atatcataat atgtacattt atattggctc atgtccaaca ttaccgccat 1980 

gttgacattg attattgact agttattaat agtaatcaat tacggggtca ttagttcata 2040 

gcccatatat ggagttccgc gttacataac ttacggtaaa tggcccgcct ggctgaccgc 2100 

ccaacgaccc ccgcccattg acgtcaataa tgacgtatgt tcccatagta acgccaatag 2160 

ggactttcca ttgacgtcaa tgggtggagt atttacggta aactgcccac ttggcagtac 2220 

atcaagtgta tcatatgcca agtacgcccc ctattgacgt caatgacggt aaatggcccg 2280 

cctggcatta tgcccagtac atgaccttat gggactttcc tacttggcag tacatctacg 2340 

tattagtcat cgctattacc atggtgatgc ggttttggca gtacatcaat gggcgtggat 2400 

agcggtttga ctcacgggga tttccaagtc tccaccccat tgacgtcaat gggagtttgt 2460 

tttggcacca aaatcaacgg gactttccaa aatgtcgtaa caactccgcc ccattgacgc 2520 

aaatgggcgg taggcatgta cggtgggagg tctatataag cagagctcgt ttagtgaacc 25 80 

gtcagatcgc ctggagacgc catccacgct gttttgacct ccatagaaga caccgggacc 264 0 

gatccagcct ccgcggcccc aagcttgttg ggatccaccg gtcgccacca tggtgagcaa 2700 

gggcgaggag ctgttcaccg gggtggtgcc catcctggtc gagctggacg gcgacgtaaa 2760 

cggccacaag ttcagcgtgt ccggcgaggg cgagggcgat gccacctacg gcaagctgac 2 82 0 

cctgaagttc atctgcacca ccggcaagct gcccgtgccc tggcccaccc tcgtgaccac 2880 

cctgacctac ggcgtgcagt gcttcagccg ctaccccgac cacatgaagc agcacgactt 2940 

cttcaagtcc gccatgcccg aaggctacgt ccaggagcgc accatcttct tcaaggacga 3000 

cggcaactac aagacccgcg ccgaggtgaa gttcgagggc gacaccctgg tgaaccgcat 3060 

cgagctgaag ggcatcgact tcaaggagga cggcaacatc ctggggcaca agctggagta 312 0 

caactacaac agccacaacg tctatatcat ggccgacaag cagaagaacg gcatcaaggt 3180 

gaacttcaag atccgccaca acatcgagga cggcagcgtg cagctcgccg accactacca 3240 

gcagaacacc cccatcggcg acggccccgt gctgctgccc gacaaccact acctgagcac 3300 

ccagtccgcc ctgagcaaag accccaacga gaagcgcgat cacatggtcc tgctggagtt 3360 

cgtgaccgcc gccgggatca ctctcggcat ggacgagctg tacaagtaaa gcggccgcga 342 0 

ctctagagtc gacctgcagg catgcaagct tcagctgctc gagggggggc ccggtaccca 3480 

gcttttgttc cctttagtga gggttaattg cgcgggaagt atttatcact aatcaagcac 3540 

aagtaataca tgagaaactt ttactacagc aagcacaatc ctccaaaaaa ttttgttttt 3 600 

acaaaatccc tggtgaacat gattggaagg gacctactag ggtgctgtgg aagggtgatg 3660 

gtgcagtagt agttaatgat gaaggaaagg gaataattgc tgtaccatta accaggacta 3720 

agttactaat aaaaccaaat tgagtattgt tgcaggaagc aagacccaac taccattgtc 3780 

agctgtgttt cctgacctca atatttgtta taaggtttga tatgaatccc agggggaatc 3 84 0 

tcaaccccta ttacccaaca gtcagaaaaa tctaagtgtg aggagaacac aatgtttcaa 3 900 

ccttattgtt ataataatga cagtaagaac agcatggcag aatcgaagga agcaagagac 3960 

caagaatgaa cctgaaagaa gaatctaaag aagaaaaaag aagaaatgac tggtggaaaa 4 02 0 

taggtatgtt tctgttatgc ttagcaggaa ctactggagg aatactttgg tggtatgaag 4080 

gactcccaca gcaacattat atagggttgg tggcgatagg gggaagatta aacggatctg 4140 

gccaatcaaa tgctatagaa tgctggggtt ccttcccggg gtgtagacca tttcaaaatt 4200 

acttcagtta tgagaccaat agaagcatgc atatggataa taatactgct acattattag 4260 

aagctttaac caatataact gctctataaa taacaaaaca gaattagaaa catggaagtt 4320 

agtaaagact tctggcataa ctcctttacc tatttcttct gaagctaaca ctggactaat 4380 

tagacataag agagattttg gtataagtgc aatagtggca gctattgtag ccgctactgc 4440 

tattgctgct agcgctacta tgtcttatgt tgctctaact gaggttaaca aaataatgga 4500 
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agtacaaaat catacttttg aggtagaaaa tagtactcta aatggtatgg atttaataga 4560 

acgacaaata aagatattat atgctatgat tcttcaaaca catgcagatg ttcaactgtt 4620 

aaaggaaaga caacaggtag aggagacatt taatttaatt ggatgtatag aaagaacaca 4680 

tgtattttgt catactggtc atccctggaa tatgtcatgg ggacatttaa atgagtcaac 4740 

acaatgggat gactgggtaa gcaaaatgga agatttaaat caagagatac taactacact 4800 

tcatggagcc aggaacaatt tggcacaatc catgataaca ttcaatacac cagatagtat 4860 

agctcaattt ggaaaagacc tttggagtca tattggaaat tggattcctg gattgggagc 4 92 0 

ttccattata aaatatatag tgatgttttt gcttatttat ttgttactaa cctcttcgcc 4980 

taagatcctc agggccctct ggaaggtgac cagtggtgca gggtcctccg gcagtcgtta 5040 

cctgaagaaa aaattccatc acaaacatgc atcgcgagaa gacacctggg accaggccca 5100 

acacaacata cacctagcag gcgtgaccgg tggatcaggg gacaaatact acaagcagaa 5160 

gtactccagg aacgactgga atggagaatc agaggagtac aacaggcggc caaagagctg 5220 

ggtgaagtca atcgaggcat ttggagagag ctatatttcc gagaagacca aaggggagat 52 8 0 

ttctcagcct ggggcggcta tcaacgagca caagaacggc tctgggggga acaatcctca 5340 

ccaagggtcc ttagacctgg agattcgaag cgaaggagga aacatttatg actgttgcat 5400 

taaagcccaa gaaggaactc tcgctatccc ttgctgtgga tttcccttat ggctattttg 5460 

gggactagta attatagtag gacgcatagc aggctatgga ttacgtggac tcgctgttat 5520 

aataaggatt tgtattagag gcttaaattt gatatttgaa ataatcagaa aaatgcttga 5580 

ttatattgga agagctttaa atcctggcac atctcatgta tcaatgcctc agtatgttta 5640 

gaaaaacaag gggggaactg tggggttttt atgaggggtt ttataaatga ttataagagt 5700 

aaaaagaaag ttgctgatgc tctcataacc ttgtataacc caaaggacta gctcatgttg 5760 

ctaggcaact aaaccgcaat aaccgcattt gtgacgcgag ttccccattg gtgacgcgtt 5820 

aacttcctgt ttttacagta tataagtgct tgtattctga caattgggca ctcagattct 5880 

gcggtctgag tcccttctct gctgggctga aaaggccttt gtaataaata taattctcta 5940 

ctcagtccct gtctctagtt tgtctgttcg agatcctaca gagctcatgc cttggcgtaa 6000 

tcatggtcat agctgtttcc tgtgtgaaat tgttatccgc tcacaattcc acacaacata 6060 

cgagccggaa gcataaagtg taaagcctgg ggtgcctaat gagtgagcta actcacatta 6120 

attgcgttgc gctcactgcc cgctttccag tcgggaaacc tgtcgtgcca gagtaggccg 6180 

cctcggccag atctcaaatt gcttagcctg atagccgcag taacgccatt ttgcaaggca 6240 

tggaaaaata ccaaaccaag aatagagaag ttcagatcaa gggcgggtac acgaaaacag 6300 

ctaacgttgg gccaaacagg atatctgcgg tgagcagttt cggccccggc ccggggccaa 6360 

gaacagatgg tcaccgcggt tcggccccgg cccggggcca agaacagatg gtccccagat 642 0 

atggcccaac cctcagcagt ttcttaagac ccatcagatg tttccaggct cccccaagga 6480 

cctgaaatga ccctgtgcct tatttgaatt aaccaatcag cctgcttctc gcttctgttc 6540 

gcgcgcttct gcttcccgag ctctataaaa gagctcacaa cccctcactc ggcgcgccag 6600 

tcctccgata gactgagtcg cccgggtacc cgtgtatcca ataaatcctc ttgctgttgc 6660 

atccgactcg tggtctcgct gttccttggg agggtctcct cagagtgatt gactacccgt 6720 

ctcgggggtc tttcatttgg gggctcgtcc gggatctgga gacccctgcc cagggaccac 6780 

cgacccacca ccgggaggct agcctcgaga attcgccacc atggctgaga gcaaggaggc 6840 

cagggatcaa gagatgaacc tcaaggaaga gagcaaagag gagaagcgcc gcaacgactg 6900 

gtggaagatc gacccacaag gccccctgga gggggaccag tggtgccgcg tgctgagaca 6960 

gtccctgccc gaggagaaga ttcctagcca gacctgcatc gccagaagac acctcggccc 7020 

cggtcccacc cagcacacac cctccagaag ggataggtgg attaggggcc agattttgca 7080 

agccgaggtc ctccaagaaa ggctggaatg gagaattagg ggcgtgcaac aagccgctaa 7140 

agagctggga gaggtgaatc gcggcatctg gagggagctc tacttccgcg aggaccagag 7200 

gggcgatttc tccgcatggg gaggctacca gagggcacaa gaaaggctgt ggggcgagca 72 60 

gagcagcccc cgcgtcttga ggcccggaga ctccaaaaga cgccgcaaac acctgtgaag 7320 

tcgacccggg cggccgcttc cctttagtga gggttaatgc ttcgagcaga catgataaga 7380 

tacattgatg agtttggaca aaccacaact agaatgcagt gaaaaaaatg ctttatttgt 744 0 

gaaatttgtg atgctattgc tttatttgta accattataa gctgcaataa acaagttaac 7500 

aacaacaatt gcattcattt tatgtttcag gttcaggggg agatgtggga ggttttttaa 7560 

agcaagtaaa acctctacaa atgtggtaaa atccgataag gatcgatccg ggctggcgta 7620 

atagcgaaga ggcccgcacc gatcgccctt cccaacagtt gcgcagcctg aatggcgaat 7680 

ggacgcgccc tgtagcggcg cattaagcgc ggcgggtgtg gtggttacgc gcagcgtgac 7740 

cgctacactt gccagcgccc tagcgcccgc tcctttcgct ttcttccctt cctttctcgc 7800 

cacgttcgcc ggctttcccc gtcaagctct aaatcggggg ctccctttag ggttccgatt 7860 

tagagcttta cggcacctcg accgcaaaaa acttgatttg ggtgatggtt cacgtaggcc 7920 

gcctcggccg cccgggcatc actgcattaa tgaatcggcc aacgcgcggg gagaggcggt 7 980 
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ttgcgtattg 

ctgcggcgag 

gataacgcag 

gtgagcaaaa 

ccataggctc 

aaacccgaca 

tcctgttccg 

ggcgctttct 

gctgggctgt 

tcgtcttgag 

caggattagc 

ctacggctac 

cggaaaaaga 

ttttgtttgc 

cttttctacg 

gagattatca 

aatctaaagt 

acctatctca 

gataactacg 

cccacgctca 

cagaagtggt 

tagagtaagt 

cgtggtgtca 

gcgagttaca 

cgttgtcaga 

ttctcttact 

gtcattctga 

taataccgcg 

gcgaaaactc 

acccaactga 

aaggcaaaat 

cttccttttt 

atttgaatgt 

gccacctaaa 

agctcatttt 

accgagatag 

gactccaacg 

ttcgtataat 

tttggggtcg 

agcttgacgg 

cggc 



ggcgctcttc 
cggtatcagc 
gaaagaacat 
ggccagcaaa 
cgcccccctg 
ggactataaa 
accctgccgc 
catagctcac 
gtgcacgaac 
tccaacccgg 
agagcgaggt 
actagaagga 
gttggtagct 
aagcagcaga 
gggtctgacg 
aaaaggatct 
atatatgagt 
gcgatctgtc 
atacgggagg 
ccggctccag 
cctgcaactt 
agttcgccag 
cgctcgtcgt 
tgatccccca 
agtaagttgg 
gtcatgccat 
gaatagtgta 
ccacatagca 
tcaaggatct 
tcttcagcat 
gccgcaaaaa 
caatattatt 
atttagaaaa 
ttgtaagcgt 
ttaaccaata 
ggttgagtgt 
tcaaagggcg 
gtatgctata 
aggtgccgta 
ggaaagccaa 



cgcttcctcg 

tcactcaaag 

gtataacttc 

aggccaggaa 

acgagcatca 

gataccaggc 

ttaccggata 

gctgtaggta 

cccccgttca 

taagacacga 

atgtaggcgg 

cagtatttgg 

cttgatccgg 

ttacgcgcag 

ctcagtggaa 

tcacctagat 

aaacttggtc 

tatttcgttc 

gcttaccatc 

atttatcagc 

tatccgcctc 

ttaatagttt 

ttggtatggc 

tgttgtgcaa 

ccgcagtgtt 

ccgtaagatg 

tgcggcgacc 

gaactttaaa 

taccgctgtt 

cttttacttt 

agggaataag 

gaagcattta 

ataaacaaat 

taatattttg 

ggccgaaatc 

tgttccagtt 

aaaaaccgtc 

cgaagttatc 

aagcactaaa 

cctggcttat 



ctcactgact 

gcggtaatac 

gtataatgta 

ccgtaaaaag 

caaaaatcga 

gtttccccct 

cctgtccgcc 

tctcagttcg 

gcccgaccgc 

cttatcgcca 

tgctacagag 

tatctgcgct 

caaacaaacc 

aaaaaaagga 

cgaaaactca 

ccttttaaat 

tgacagttac 

atccatagtt 

tggccccagt 

aataaaccag 

catccagtct 

gcgcaacgtt 

ttcattcagc 

aaaagcggtt 

atcactcatg 

cttttctgtg 

gagttgctct 

agtgctcatc 

gagatccagt 

caccagcgtt 

ggcgacacgg 

tcagggttat 

aggggttccg 

ttaaaattcg 

ggcaaaatcc 

tggaacaaga 

tatcagggcg 

actacgtgaa 

tcggaaccct 

cgaaattaat 



cgctgcgctc 

ggttatccac 

tgctatacga 

gccgcgttgc 

cgctcaagtc 

ggaagctccc 

tttctccctt 

gtgtaggtcg 

tgcgccttat 

ctggcagcag 

ttcttgaagt 

ctgctgaagc 

accgctggta 

tctcaagaag 

cgttaaggga 

taaaaatgaa 

caatgcttaa 

gcctgactcc 

gctgcaatga 

ccagccggaa 

attaattgtt 

gttgccattg 

tccggttccc 

agctccttcg 

gttatggcag 

actggtgagt 

tgcccggcgt 

attggaaaac 

tcgatgtaac 

tctgggtgag 

aaatgttgaa 

tgtctcatga 

cgcacatttc 

cgttaaattt 

cttataaatc 

gtccactatt 

atggcccact 

ccatcaccct 

aaagggagcc 

acgactcact 



ggtcgttcgg 

agaatcaggg 

agttatacat 

tggcgttttt 

agaggtggcg 

tcgtgcgctc 

cgggaagcgt 

ttcgctccaa 

ccggtaacta 

ccactggtaa 

ggtggcctaa 

cagttacctt 

gcggtggttt 

atcctttgat 

ttttggtcat 

gttttaaatc 

tcagtgaggc 

ccgtcgtgta 

taccgcgaga 

gggccgagcg 

gccgggaagc 

ctacaggcat 

aacgatcaag 

gtcctccgat 

cactgcataa 

actcaaccaa 

caatacggga 

gttcttcggg 

ccactcgtgc 

caaaaacagg 

tactcatact 

gcggatacat 

cccgaaaagt 

ttgttaaatc 

aaaagaatag 

aaagaacgtg 

acgtgataac 

aatcaagttt 

cccgatttag 

atagggagac 



8040 
8100 
8160 
8220 
8280 
8340 
8400 
8460 
8520 
8580 
8640 
8700 
8760 
8820 
8880 
8940 
9000 
9060 
9120 
9180 
9240 
9300 
9360 
9420 
9480 
9540 
9600 
9660 
9720 
9780 
9840 
9900 
9960 
10020 
10080 
10140 
10200 
10260 
10320 
10380 
10384 



<210> 47 
<211> 10292 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Sytnthetic 
oligonucleotide 

<400> 47 

agatcttgaa taataaaatg tgtgtttgtc cgaaatacgc gttttgagat ttctgtcgcc 
gactaaattc atgtcgcgcg atagtggtgt ttatcgccga tagagatggc gatattggaa 
aaattgatat ttgaaaatat ggcatattga aaatgtcgcc gatgtgagtt tctgtgtaac 
tgatatcgcc atttttccaa aagtgatttt tgggcatacg cgatatctgg cgatagcgct 
tatatcgttt acgggggatg gcgatagacg actttggtga cttgggcgat tctgtgtgtc 
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gcaaatatcg cagtttcgat ataggtgaca gacgatatga ggctatatcg ccgatagagg 360 

cgacatcaag ctggcacatg gccaatgcat atcgatctat acattgaatc aatattggcc 420 

attagccata ttattcattg gttatatagc ataaatcaat attggctatt ggccattgca 480 

tacgttgtat ccatatcgta atatgtacat ttatattggc tcatgtccaa cattaccgcc 540 

atgttgacat tgattattga ctagttatta atagtaatca attacggggt cattagttca 600 

tagcccatat atggagttcc gcgttacata acttacggta aatggcccgc ctggctgacc 660 

gcccaacgac ccccgcccat tgacgtcaat aatgacgtat gttcccatag taacgccaat 720 

agggactttc cattgacgtc aatgggtgga gtatttacgg taaactgccc acttggcagt 780 

acatcaagtg tatcatatgc caagtccgcc ccctattgac gtcaatgacg gtaaatggcc 840 

cgcctggcat tatgcccagt acatgacctt acgggacttt cctacttggc agtacatcta 900 

cgtattagtc atcgctatta ccatggtgat gcggttttgg cagtacacca atgggcgtgg 960 

atagcggttt gactcacggg gatttccaag tctccacccc attgacgtca atgggagttt 1020 

gttttggcac caaaatcaac gggactttcc aaaatgtcgt aacaactgcg atcgcccgcc 1080 

ccgttgacgc aaatgggcgg taggcgtgta cggtgggagg tctatataag cagagctcgt 114 0 

ttagtgaacc gggcactcag attctgcggt ctgagtccct tctctgctgg gctgaaaagg 1200 

cctttgtaat aaatataatt ctctactcag tccctgtctc tagtttgtct gttcgagatc 1260 

ctacagttgg cgcccgaaca gggacctgag aggggcgcag accctacctg ttgaacctgg 132 0 

ctgatcgtag gatccccggg acagcagagg agaacttaca gaagtcttct ggaggtgttc 1380 

ctggccagaa cacaggagga caggtaagat tgggagaccc tttgacattg gagcaaggcg 144 0 

ctcaagaagt tagagaaggt gacggtacaa gggtctcaga aattaactac tggtaactgt 1500 

aattgggcgc taagtctagt agacttattt catgatacca actttgtaaa agaaaaggac 1560 

tggcagctga gggatgtcat tccattgctg gaagatgtaa ctcagacgct gtcaggacaa 162 0 

gaaagagagg cctttgaaag aacatggtgg gcaatttctg ctgtaaagat gggcctccag 1680 

attaataatg tagtagatgg aaaggcatca ttccagctcc taagagcgaa atatgaaaag 1740 

aagactgcta ataaaaagca gtctgagccc tctgaagaat atctctagaa ctagtggatc 1800 

ccccgggctg caggagtggg gaggcacgat ggccgctttg gtcgaggcgg atccggccat 1860 

tagccatatt attcattggt tatatagcat aaatcaatat tggctattgg ccattgcata 1920 

cgttgtatcc atatcataat atgtacattt atattggctc atgtccaaca ttaccgccat 1980 

gttgacattg attattgact agttattaat agtaatcaat tacggggtca ttagttcata 2040 

gcccatatat ggagttccgc gttacataac ttacggtaaa tggcccgcct ggctgaccgc 2100 

ccaacgaccc ccgcccattg acgtcaataa tgacgtatgt tcccatagta acgccaatag 2160 

ggactttcca ttgacgtcaa tgggtggagt atttacggta aactgcccac ttggcagtac 2220 

atcaagtgta tcatatgcca agtacgcccc ctattgacgt caatgacggt aaatggcccg 2280 

cctggcatta tgcccagtac atgaccttat gggactttcc tacttggcag tacatctacg 2340 

tattagtcat cgctattacc atggtgatgc ggttttggca gtacatcaat gggcgtggat 2400 

agcggtttga ctcacgggga tttccaagtc tccaccccat tgacgtcaat gggagtttgt 2460 

tttggcacca aaatcaacgg gactttccaa aatgtcgtaa caactccgcc ccattgacgc 2520 

aaatgggcgg taggcatgta cggtgggagg tctatataag cagagctcgt ttagtgaacc 2580 

gtcagatcgc ctggagacgc catccacgct gttttgacct ccatagaaga caccgggacc 2 64 0 

gatccagcct ccgcggcccc aagcttgttg ggatccaccg gtcgccacca tggtgagcaa 2700 

gggcgaggag ctgttcaccg gggtggtgcc catcctggtc gagctggacg gcgacgtaaa 2760 

cggccacaag ttcagcgtgt ccggcgaggg cgagggcgat gccacctacg gcaagctgac 2 82 0 

cctgaagttc atctgcacca ccggcaagct gcccgtgccc tggcccaccc tcgtgaccac 2 880 

cctgacctac ggcgtgcagt gcttcagccg ctaccccgac cacatgaagc agcacgactt 2940 

cttcaagtcc gccatgcccg aaggctacgt ccaggagcgc accatcttct tcaaggacga 3000 

cggcaactac aagacccgcg ccgaggtgaa gttcgagggc gacaccctgg tgaaccgcat 3060 

cgagctgaag ggcatcgact tcaaggagga cggcaacatc ctggggcaca agctggagta 312 0 

caactacaac agccacaacg tctatatcat ggccgacaag cagaagaacg gcatcaaggt 3180 

gaacttcaag atccgccaca acatcgagga cggcagcgtg cagctcgccg accactacca 3240 

gcagaacacc cccatcggcg acggccccgt gctgctgccc gacaaccact acctgagcac 3300 

ccagtccgcc ctgagcaaag accccaacga gaagcgcgat cacatggtcc tgctggagtt 3 3 60 

cgtgaccgcc gccgggatca ctctcggcat ggacgagctg tacaagtaaa gcggccgcga 3420 

ctctagagtc gacctgcagg catgcaagct tcagctgctc gagggggggc ccggtaccca 3480 

gcttttgttc cctttagtga gggttaattg cgcgggaagt atttatcact aatcaagcac 3540 

aagtaataca tgagaaactt ttactacagc aagcacaatc ctccaaaaaa ttttgttttt 3600 

acaaaatccc tggtgaacat gattggaagg gacctactag ggtgctgtgg aagggtgatg 3 660 

gtgcagtagt agttaatgat gaaggaaagg gaataattgc tgtaccatta accaggacta 3720 

agttactaat aaaaccaaat tgagtattgt tgcaggaagc aagacccaac taccattgtc 3780 
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agctgtgttt cctgacctca atatttgtta taaggtttga tatgaatccc agggggaatc 3 84 0 

tcaaccccta ttacccaaca gtcagaaaaa tctaagtgtg aggagaacac aatgtttcaa 3 900 

ccttattgtt ataataatga cagtaagaac agcatggcag aatcgaagga agcaagagac 3960 

caagaatgaa cctgaaagaa gaatctaaag aagaaaaaag aagaaatgac tggtggaaaa 4 02 0 

taggtatgtt tctgttatgc ttagcaggaa ctactggagg aatactttgg tggtatgaag 4080 

gactcccaca gcaacattat atagggttgg tggcgatagg gggaagatta aacggatctg 414 0 

gccaatcaaa tgctatagaa tgctggggtt ccttcccggg gtgtagacca tttcaaaatt 4200 

acttcagtta tgagaccaat agaagcatgc atatggataa taatactgct acattattag 4260 

aagctttaac caatataact gctctataaa taacaaaaca gaattagaaa catggaagtt 4320 

agtaaagact tctggcataa ctcctttacc tatttcttct gaagctaaca ctggactaat 4380 

tagacataag agagattttg gtataagtgc aatagtggca gctattgtag ccgctactgc 4440 

tattgctgct agcgctacta tgtcttatgt tgctctaact gaggttaaca aaataatgga 4500 

agtacaaaat catacttttg aggtagaaaa tagtactcta aatggtatgg atttaataga 4560 

acgacaaata aagatattat atgctatgat tcttcaaaca catgcagatg ttcaactgtt 4620 

aaaggaaaga caacaggtag aggagacatt taatttaatt ggatgtatag aaagaacaca 4680 

tgtattttgt catactggtc atccctggaa tatgtcatgg ggacatttaa atgagtcaac 4740 

acaatgggat gactgggtaa gcaaaatgga agatttaaat caagagatac taactacact 4800 

tcatggagcc aggaacaatt tggcacaatc catgataaca ttcaatacac cagatagtat 4860 

agctcaattt ggaaaagacc tttggagtca tattggaaat tggattcctg gattgggagc 4 92 0 

ttccattata aaatatatag tgatgttttt gcttatttat ttgttactaa cctcttcgcc 4980 

taagatcctc agggccctct ggaaggtgac cagtggtgca gggtcctccg gcagtcgtta 5040 

cctgaagaaa aaattccatc acaaacatgc atcgcgagaa gacacctggg accaggccca 5100 

acacaacata cacctagcag gcgtgaccgg tggatcaggg gacaaatact acaagcagaa 5160 

gtactccagg aacgactgga atggagaatc agaggagtac aacaggcggc caaagagctg 5220 

ggtgaagtca atcgaggcat ttggagagag ctatatttcc gagaagacca aaggggagat 5280 

ttctcagcct ggggcggcta tcaacgagca caagaacggc tctgggggga acaatcctca 5340 

ccaagggtcc ttagacctgg agattcgaag cgaaggagga aacatttatg actgttgcat 5400 

taaagcccaa gaaggaactc tcgctatccc ttgctgtgga tttcccttat ggctattttg 5460 

gggactagta attatagtag gacgcatagc aggctatgga ttacgtggac tcgctgttat 552 0 

aataaggatt tgtattagag gcttaaattt gatatttgaa ataatcagaa aaatgcttga 5580 

ttatattgga agagctttaa atcctggcac atctcatgta tcaatgcctc agtatgttta 5640 

gaaaaacaag gggggaactg tggggttttt atgaggggtt ttataaatga ttataagagt 5700 

aaaaagaaag ttgctgatgc tctcataacc ttgtataacc caaaggacta gctcatgttg 5760 

ctaggcaact aaaccgcaat aaccgcattt gtgacgcgag ttccccattg gtgacgcgtt 5820 

aacttcctgt ttttacagta tataagtgct tgtattctga caattgggca ctcagattct 5880 

gcggtctgag tcccttctct gctgggctga aaaggccttt gtaataaata taattctcta 5940 

ctcagtccct gtctctagtt tgtctgttcg agatcctaca gagctcatgc cttggcgtaa 6000 

tcatggtcat agctgtttcc tgtgtgaaat tgttatccgc tcacaattcc acacaacata 6060 

cgagccggaa gcataaagtg taaagcctgg ggtgcctaat gagtgagcta actcacatta 6120 

attgcgttgc gctcactgcc cgctttccag tcgggaaacc tgtcgtgcca gtgatgcccg 6180 

ggcggccgag gcggcctacg tgaaccatca cccaaatcaa gttttttgcg gtcgaggtgc 624 0 

cgtaaagctc taaatcggaa ccctaaaggg agcccccgat ttagagcttg acggggaaag 6300 

ccggcgaacg tggcgagaaa ggaagggaag aaagcgaaag gagcgggcgc tagggcgctg 63 60 

gcaagtgtag cggtcacgct gcgcgtaacc accacacccg ccgcgcttaa tgcgccgcta 6420 

cagggcgcgt ccattcgcca ttcaggctgc gcaactgttg ggaagggcga tcggtgcggg 6480 

cctcttcgct attacgccag cccggatcga tccttatcgg attttaccac atttgtagag 6540 

gttttacttg ctttaaaaaa cctcccacat ctccccctga acctgaaaca taaaatgaat 6600 
gcaattgttg ttgttaactt gtttattgca gcttataatg gttacaaata aagcaatagc 6660 
atcacaaatt tcacaaataa agcatttttt tcactgcatt ctagttgtgg tttgtccaaa 6720 
ctcatcaatg tatcttatca tgtctgctcg aagcattaac cctcactaaa gggaagcggc 6780 
cgcccgggtc gacttcacag gtgtttgcgg cgtcttttgg agtctccggg cctcaagacg 6840 
cgggggctgc tctgctcgcc ccacagcctt tcttgtgccc tctggtagcc tccccatgcg 6900 
gagaaatcgc ccctctggtc ctcgcggaag tagagctccc tccagatgcc gcgattcacc 6960 
tctcccagct ctttagcggc ttgttgcacg cccctaattc tccattccag cctttcttgg 7020 
aggacctcgg cttgcaaaat ctggccccta atccacctat cccttctgga gggtgtgtgc 7080 
tgggtgggac cggggccgag gtgtcttctg gcgatgcagg tctggctagg aatcttctcc 7140 
tcgggcaggg actgtctcag cacgcggcac cactggtccc cctccagggg gccttgtggg 7200 
tcgatcttcc accagtcgtt gcggcgcttc tcctctttgc tctcttcctt gaggttcatc 7260 
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tcttgatccc tggcctcctt gctctcagcc atggtggcga attctcgagg ctagcctggg 7320 

gagagaggtc ggtgattcgg tcaacgaggg agccgactgc cgacgtgcgc tccggaggct 73 80 

tgcagaatgc ggaacaccgc gcgggcagga acagggccca cactaccgcc ccacaccccg 7440 

cctcccgcac cgccccttcc cggccgctgc tctcggcgcg ccccgctgag cagccgctat 7500 

tggccacagc ccatcgcggt cggcgcgctg ccattgctcc ctggcgctgt ccgtctgcga 7560 

gggtactagt gagacgtgcg gcttccgttt gtcacgtccg gcacgccgcg aaccgcaagg 762 0 

aaccttcccg acttaggggc ggagcaggaa gcgtcgccgg ggggcccaca agggtagcgg 7680 

cgaagatccg ggtgacgctg cgaacggacg tgaagaatgt gcgagaccca gggtcggcgc 774 0 

cgctgcgttt cccggaacca cgcccagagc agccgcgtcc ctgcgcaaac ccagggctgc 7800 

cttggaaaag gcgcaacccc aaccccagat ctggccgagg cggcctactc tgcattaatg 7860 

aatcggccaa cgcgcgggga gaggcggttt gcgtattggg cgctcttccg cttcctcgct 7920 

cactgactcg ctgcgctcgg tcgttcggct gcggcgagcg gtatcagctc actcaaaggc 7980 

ggtaatacgg ttatccacag aatcagggga taacgcagga aagaacatgt ataacttcgt 8040 

ataatgtatg ctatacgaag ttatacatgt gagcaaaagg ccagcaaaag gccaggaacc 8100 

gtaaaaaggc cgcgttgctg gcgtttttcc ataggctccg cccccctgac gagcatcaca 8160 

aaaatcgacg ctcaagtcag aggtggcgaa acccgacagg actataaaga taccaggcgt 8220 

ttccccctgg aagctccctc gtgcgctctc ctgttccgac cctgccgctt accggatacc 8280 

tgtccgcctt tctcccttcg ggaagcgtgg cgctttctca tagctcacgc tgtaggtatc 8340 

tcagttcggt gtaggtcgtt cgctccaagc tgggctgtgt gcacgaaccc cccgttcagc 8400 

ccgaccgctg cgccttatcc ggtaactatc gtcttgagtc caacccggta agacacgact 8460 

tatcgccact ggcagcagcc actggtaaca ggattagcag agcgaggtat gtaggcggtg 8520 

ctacagagtt cttgaagtgg tggcctaact acggctacac tagaaggaca gtatttggta 8580 

tctgcgctct gctgaagcca gttaccttcg gaaaaagagt tggtagctct tgatccggca 8640 

aacaaaccac cgctggtagc ggtggttttt ttgtttgcaa gcagcagatt acgcgcagaa 8700 

aaaaaggatc tcaagaagat cctttgatct tttctacggg gtctgacgct cagtggaacg 8760 

aaaactcacg ttaagggatt ttggtcatga gattatcaaa aaggatcttc acctagatcc 8820 

ttttaaatta aaaatgaagt tttaaatcaa tctaaagtat atatgagtaa acttggtctg 8880 

acagttacca atgcttaatc agtgaggcac ctatctcagc gatctgtcta tttcgttcat 8940 

ccatagttgc ctgactcccc gtcgtgtaga taactacgat acgggagggc ttaccatctg 9000 

gccccagtgc tgcaatgata ccgcgagacc cacgctcacc ggctccagat ttatcagcaa 9060 

taaaccagcc agccggaagg gccgagcgca gaagtggtcc tgcaacttta tccgcctcca 912 0 

tccagtctat taattgttgc cgggaagcta gagtaagtag ttcgccagtt aatagtttgc 9180 

gcaacgttgt tgccattgct acaggcatcg tggtgtcacg ctcgtcgttt ggtatggctt 9240 

cattcagctc cggttcccaa cgatcaaggc gagttacatg atcccccatg ttgtgcaaaa 93 00 

aagcggttag ctccttcggt cctccgatcg ttgtcagaag taagttggcc gcagtgttat 9360 

cactcatggt tatggcagca ctgcataatt ctcttactgt catgccatcc gtaagatgct 9420 

tttctgtgac tggtgagtac tcaaccaagt cattctgaga atagtgtatg cggcgaccga 9480 

gttgctcttg cccggcgtca atacgggata ataccgcgcc acatagcaga actttaaaag 9540 

tgctcatcat tggaaaacgt tcttcggggc gaaaactctc aaggatctta ccgctgttga 9600 

gatccagttc gatgtaaccc actcgtgcac ccaactgatc ttcagcatct tttactttca 9660 

ccagcgtttc tgggtgagca aaaacaggaa ggcaaaatgc cgcaaaaaag ggaataaggg 972 0 

cgacacggaa atgttgaata ctcatactct tcctttttca atattattga agcatttatc 9780 

agggttattg tctcatgagc ggatacatat ttgaatgtat ttagaaaaat aaacaaatag 9840 

gggttccgcg cacatttccc cgaaaagtgc cacctaaatt gtaagcgtta atattttgtt 9900 

aaaattcgcg ttaaattttt gttaaatcag ctcatttttt aaccaatagg ccgaaatcgg 9960 

caaaatccct tataaatcaa aagaatagac cgagataggg ttgagtgttg ttccagtttg 10020 

gaacaagagt ccactattaa agaacgtgga ctccaacgtc aaagggcgaa aaaccgtcta 10080 

tcagggcgat ggcccactac gtgataactt cgtataatgt atgctatacg aagttatcac 10140 

tacgtgaacc atcaccctaa tcaagttttt tggggtcgag gtgccgtaaa gcactaaatc 10200 

ggaaccctaa agggagcccc cgatttagag cttgacgggg aaagccaacc tggcttatcg 10260 

aaattaatac gactcactat agggagaccg gc 10292 



<210> 48 
<211> 10292 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: Synthetic 
oligonucleotide 

<400> 48 

agatcttgaa taataaaatg tgtgtttgtc cgaaatacgc gttttgagat ttctgtcgcc 

gactaaattc atgtcgcgcg atagtggtgt ttatcgccga tagagatggc gatattggaa 12 0 

aaattgatat ttgaaaatat ggcatattga aaatgtcgcc gatgtgagtt tctgtgtaac 180 

tgatatcgcc atttttccaa aagtgatttt tgggcatacg cgatatctgg cgatagcgct 24 0 

tatatcgttt acgggggatg gcgatagacg actttggtga cttgggcgat tctgtgtgtc 300 

gcaaatatcg cagtttcgat ataggtgaca gacgatatga ggctatatcg ccgatagagg 360 

cgacatcaag ctggcacatg gccaatgcat atcgatctat acattgaatc aatattggcc 42 0 

attagccata ttattcattg gttatatagc ataaatcaat attggctatt ggccattgca 480 

tacgttgtat ccatatcgta atatgtacat ttatattggc tcatgtccaa cattaccgcc 540 

atgttgacat tgattattga ctagttatta atagtaatca attacggggt cattagttca 600 

tagcccatat atggagttcc gcgttacata acttacggta aatggcccgc ctggctgacc 660 

gcccaacgac ccccgcccat tgacgtcaat aatgacgtat gttcccatag taacgccaat 72 0 
agggactttc cattgacgtc aatgggtgga gtatttacgg taaactgccc acttggcagt 
acatcaagtg tatcatatgc caagtccgcc ccctattgac gtcaatgacg gtaaatggcc 
cgcctggcat tatgcccagt acatgacctt acgggacttt cctacttggc agtacatcta 

cgtattagtc atcgctatta ccatggtgat gcggttttgg cagtacacca atgggcgtgg 960 

atagcggttt gactcacggg gatttccaag tctccacccc attgacgtca atgggagttt 1020 

gttttggcac caaaatcaac gggactttcc aaaatgtcgt aacaactgcg atcgcccgcc 1080 

ccgttgacgc aaatgggcgg taggcgtgta cggtgggagg tctatataag cagagctcgt 1140 

ttagtgaacc gggcactcag attctgcggt ctgagtccct tctctgctgg gctgaaaagg 1200 

cctttgtaat aaatataatt ctctactcag tccctgtctc tagtttgtct gttcgagatc 1260 

ctacagttgg cgcccgaaca gggacctgag aggggcgcag accctacctg ttgaacctgg 1320 
ctgatcgtag gatccccggg acagcagagg agaacttaca gaagtcttct ggaggtgttc 
ctggccagaa cacaggagga caggtaagat tgggagaccc tttgacattg gagcaaggcg 
ctcaagaagt tagagaaggt gacggtacaa gggtctcaga aattaactac tggtaactgt 

aattgggcgc taagtctagt agacttattt catgatacca actttgtaaa agaaaaggac 1560 

tggcagctga gggatgtcat tccattgctg gaagatgtaa ctcagacgct gtcaggacaa 162 0 

gaaagagagg cctttgaaag aacatggtgg gcaatttctg ctgtaaagat gggcctccag 1680 

attaataatg tagtagatgg aaaggcatca ttccagctcc taagagcgaa atatgaaaag 1740 

aagactgcta ataaaaagca gtctgagccc tctgaagaat atctctagaa ctagtggatc 1800 

ccccgggctg caggagtggg gaggcacgat ggccgctttg gtcgaggcgg atccggccat 1860 

tagccatatt attcattggt tatatagcat aaatcaatat tggctattgg ccattgcata 1920 

cgttgtatcc atatcataat atgtacattt atattggctc atgtccaaca ttaccgccat 1980 

gttgacattg attattgact agttattaat agtaatcaat tacggggtca ttagttcata 2040 

gcccatatat ggagttccgc gttacataac ttacggtaaa tggcccgcct ggctgaccgc 2100 

ccaacgaccc ccgcccattg acgtcaataa tgacgtatgt tcccatagta acgccaatag 2160 

ggactttcca ttgacgtcaa tgggtggagt atttacggta aactgcccac ttggcagtac 2220 

atcaagtgta tcatatgcca agtacgcccc ctattgacgt caatgacggt aaatggcccg 2280 

cctggcatta tgcccagtac atgaccttat gggactttcc tacttggcag tacatctacg 2340 

tattagtcat cgctattacc atggtgatgc ggttttggca gtacatcaat gggcgtggat 2400 

agcggtttga ctcacgggga tttccaagtc tccaccccat tgacgtcaat gggagtttgt 2460 

tttggcacca aaatcaacgg gactttccaa aatgtcgtaa caactccgcc ccattgacgc 2520 

aaatgggcgg taggcatgta cggtgggagg tctatataag cagagctcgt ttagtgaacc 2580 

gtcagatcgc ctggagacgc catccacgct gttttgacct ccatagaaga caccgggacc 2640 

gatccagcct ccgcggcccc aagcttgttg ggatccaccg gtcgccacca tggtgagcaa 2700 

gggcgaggag ctgttcaccg gggtggtgcc catcctggtc gagctggacg gcgacgtaaa 2760 

cggccacaag ttcagcgtgt ccggcgaggg cgagggcgat gccacctacg gcaagctgac 2 82 0 

cctgaagttc atctgcacca ccggcaagct gcccgtgccc tggcccaccc tcgtgaccac 2880 

cctgacctac ggcgtgcagt gcttcagccg ctaccccgac cacatgaagc agcacgactt 2940 

cttcaagtcc gccatgcccg aaggctacgt ccaggagcgc accatcttct tcaaggacga 3 000 

cggcaactac aagacccgcg ccgaggtgaa gttcgagggc gacaccctgg tgaaccgcat 3060 

cgagctgaag ggcatcgact tcaaggagga cggcaacatc ctggggcaca agctggagta 3120 

caactacaac agccacaacg tctatatcat ggccgacaag cagaagaacg gcatcaaggt 3180 



780 
840 
900 



1380 
1440 
1500 
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gaacttcaag atccgccaca acatcgagga cggcagcgtg cagctcgccg accactacca 3240 

gcagaacacc cccatcggcg acggccccgt gctgctgccc gacaaccact acctgagcac 3300 

ccagtccgcc ctgagcaaag accccaacga gaagcgcgat cacatggtcc tgctggagtt 3360 

cgtgaccgcc gccgggatca ctctcggcat ggacgagctg tacaagtaaa gcggccgcga 3420 

ctctagagtc gacctgcagg catgcaagct tcagctgctc gagggggggc ccggtaccca 3480 

gcttttgttc cctttagtga gggttaattg cgcgggaagt atttatcact aatcaagcac 3 54 0 

aagtaataca tgagaaactt ttactacagc aagcacaatc ctccaaaaaa ttttgttttt 3600 

acaaaatccc tggtgaacat gattggaagg gacctactag ggtgctgtgg aagggtgatg 3660 

gtgcagtagt agttaatgat gaaggaaagg gaataattgc tgtaccatta accaggacta 3720 

agttactaat aaaaccaaat tgagtattgt tgcaggaagc aagacccaac taccattgtc 3780 

agctgtgttt cctgacctca atatttgtta taaggtttga tatgaatccc agggggaatc 3 84 0 

tcaaccccta ttacccaaca gtcagaaaaa tctaagtgtg aggagaacac aatgtttcaa 3900 

ccttattgtt ataataatga cagtaagaac agcatggcag aatcgaagga agcaagagac 3 960 

caagaatgaa cctgaaagaa gaatctaaag aagaaaaaag aagaaatgac tggtggaaaa 4 02 0 

taggtatgtt tctgttatgc ttagcaggaa ctactggagg aatactttgg tggtatgaag 4080 

gactcccaca gcaacattat atagggttgg tggcgatagg gggaagatta aacggatctg 4140 

gccaatcaaa tgctatagaa tgctggggtt ccttcccggg gtgtagacca tttcaaaatt 4200 

acttcagtta tgagaccaat agaagcatgc atatggataa taatactgct acattattag 4260 

aagctttaac caatataact gctctataaa taacaaaaca gaattagaaa catggaagtt 4320 

agtaaagact tctggcataa ctcctttacc tatttcttct gaagctaaca ctggactaat 43 80 

tagacataag agagattttg gtataagtgc aatagtggca gctattgtag ccgctactgc 4440 

tattgctgct agcgctacta tgtcttatgt tgctctaact gaggttaaca aaataatgga 4500 

agtacaaaat catacttttg aggtagaaaa tagtactcta aatggtatgg atttaataga 4560 

acgacaaata aagatattat atgctatgat tcttcaaaca catgcagatg ttcaactgtt 4620 

aaaggaaaga caacaggtag aggagacatt taatttaatt ggatgtatag aaagaacaca 4680 

tgtattttgt catactggtc atccctggaa tatgtcatgg ggacatttaa atgagtcaac 4740 

acaatgggat gactgggtaa gcaaaatgga agatttaaat caagagatac taactacact 4800 

tcatggagcc aggaacaatt tggcacaatc catgataaca ttcaatacac cagatagtat 4860 

agctcaattt ggaaaagacc tttggagtca tattggaaat tggattcctg gattgggagc 4 92 0 

ttccattata aaatatatag tgatgttttt gcttatttat ttgttactaa cctcttcgcc 4980 

taagatcctc agggccctct ggaaggtgac cagtggtgca gggtcctccg gcagtcgtta 5040 

cctgaagaaa aaattccatc acaaacatgc atcgcgagaa gacacctggg accaggccca 5100 

acacaacata cacctagcag gcgtgaccgg tggatcaggg gacaaatact acaagcagaa 5160 

gtactccagg aacgactgga atggagaatc agaggagtac aacaggcggc caaagagctg 5220 

ggtgaagtca atcgaggcat ttggagagag ctatatttcc gagaagacca aaggggagat 5280 

ttctcagcct ggggcggcta tcaacgagca caagaacggc tctgggggga acaatcctca 5340 

ccaagggtcc ttagacctgg agattcgaag cgaaggagga aacatttatg actgttgcat 5400 

taaagcccaa gaaggaactc tcgctatccc ttgctgtgga tttcccttat ggctattttg 5460 

gggactagta attatagtag gacgcatagc aggctatgga ttacgtggac tcgctgttat 552 0 

aataaggatt tgtattagag gcttaaattt gatatttgaa ataatcagaa aaatgcttga 5580 

ttatattgga agagctttaa atcctggcac atctcatgta tcaatgcctc agtatgttta 5640 

gaaaaacaag gggggaactg tggggttttt atgaggggtt ttataaatga ttataagagt 5700 

aaaaagaaag ttgctgatgc tctcataacc ttgtataacc caaaggacta gctcatgttg 5760 

ctaggcaact aaaccgcaat aaccgcattt gtgacgcgag ttccccattg gtgacgcgtt 5820 

aacttcctgt ttttacagta tataagtgct tgtattctga caattgggca ctcagattct 5880 

gcggtctgag tcccttctct gctgggctga aaaggccttt gtaataaata taattctcta 5940 

ctcagtccct gtctctagtt tgtctgttcg agatcctaca gagctcatgc cttggcgtaa 6000 

tcatggtcat agctgtttcc tgtgtgaaat tgttatccgc tcacaattcc acacaacata 6060 

cgagccggaa gcataaagtg taaagcctgg ggtgcctaat gagtgagcta actcacatta 6120 

attgcgttgc gctcactgcc cgctttccag tcgggaaacc tgtcgtgcca gagtaggccg 6180 

cctcggccag atctggggtt ggggttgcgc cttttccaag gcagccctgg gtttgcgcag 6240 

ggacgcggct gctctgggcg tggttccggg aaacgcagcg gcgccgaccc tgggtctcgc 63 0 0 

acattcttca cgtccgttcg cagcgtcacc cggatcttcg ccgctaccct tgtgggcccc 6360 

ccggcgacgc ttcctgctcc gcccctaagt cgggaaggtt ccttgcggtt cgcggcgtgc 6420 

cggacgtgac aaacggaagc cgcacgtctc actagtaccc tcgcagacgg acagcgccag 6480 

ggagcaatgg cagcgcgccg accgcgatgg gctgtggcca atagcggctg ctcagcgggg 6540 

cgcgccgaga gcagcggccg ggaaggggcg gtgcgggagg cggggtgtgg ggcggtagtg 6600 

tgggccctgt tcctgcccgc gcggtgttcc gcattctgca agcctccgga gcgcacgtcg 6660 
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gcagtcggct ccctcgttga ccgaatcacc gacctctctc cccaggctag cctcgagaat 6720 

tcgccaccat ggctgagagc aaggaggcca gggatcaaga gatgaacctc aaggaagaga 6780 

gcaaagagga gaagcgccgc aacgactggt ggaagatcga cccacaaggc cccctggagg 6840 

gggaccagtg gtgccgcgtg ctgagacagt ccctgcccga ggagaagatt cctagccaga 6900 

cctgcatcgc cagaagacac ctcggccccg gtcccaccca gcacacaccc tccagaaggg 6 960 

ataggtggat taggggccag attttgcaag ccgaggtcct ccaagaaagg ctggaatgga 7 02 0 

gaattagggg cgtgcaacaa gccgctaaag agctgggaga ggtgaatcgc ggcatctgga 7080 

gggagctcta cttccgcgag gaccagaggg gcgatttctc cgcatgggga ggctaccaga 714 0 

gggcacaaga aaggctgtgg ggcgagcaga gcagcccccg cgtcttgagg cccggagact 7200 

ccaaaagacg ccgcaaacac ctgtgaagtc gacccgggcg gccgcttccc tttagtgagg 7260 

gttaatgctt cgagcagaca tgataagata cattgatgag tttggacaaa ccacaactag 7320 

aatgcagtga aaaaaatgct ttatttgtga aatttgtgat gctattgctt tatttgtaac 7380 

cattataagc tgcaataaac aagttaacaa caacaattgc attcatttta tgtttcaggt 7440 

tcagggggag atgtgggagg ttttttaaag caagtaaaac ctctacaaat gtggtaaaat 7500 

ccgataagga tcgatccggg ctggcgtaat agcgaagagg cccgcaccga tcgcccttcc 7560 

caacagttgc gcagcctgaa tggcgaatgg acgcgccctg tagcggcgca ttaagcgcgg 7620 

cgggtgtggt ggttacgcgc agcgtgaccg ctacacttgc cagcgcccta gcgcccgctc 7680 

ctttcgcttt cttcccttcc tttctcgcca cgttcgccgg ctttccccgt caagctctaa 7740 

atcgggggct ccctttaggg ttccgattta gagctttacg gcacctcgac cgcaaaaaac 7800 

ttgatttggg tgatggttca cgtaggccgc ctcggccgcc cgggcatcac tgcattaatg 7860 

aatcggccaa cgcgcgggga gaggcggttt gcgtattggg cgctcttccg cttcctcgct 7 92 0 

cactgactcg ctgcgctcgg tcgttcggct gcggcgagcg gtatcagctc actcaaaggc 7980 

ggtaatacgg ttatccacag aatcagggga taacgcagga aagaacatgt ataacttcgt 8040 

ataatgtatg ctatacgaag ttatacatgt gagcaaaagg ccagcaaaag gccaggaacc 8100 

gtaaaaaggc cgcgttgctg gcgtttttcc ataggctccg cccccctgac gagcatcaca 8160 

aaaatcgacg ctcaagtcag aggtggcgaa acccgacagg actataaaga taccaggcgt 822 0 

ttccccctgg aagctccctc gtgcgctctc ctgttccgac cctgccgctt accggatacc 8280 

tgtccgcctt tctcccttcg ggaagcgtgg cgctttctca tagctcacgc tgtaggtatc 8340 

tcagttcggt gtaggtcgtt cgctccaagc tgggctgtgt gcacgaaccc cccgttcagc 8400 

ccgaccgctg cgccttatcc ggtaactatc gtcttgagtc caacccggta agacacgact 8460 

tatcgccact ggcagcagcc actggtaaca ggattagcag agcgaggtat gtaggcggtg 8520 

ctacagagtt cttgaagtgg tggcctaact acggctacac tagaaggaca gtatttggta 8580 

tctgcgctct gctgaagcca gttaccttcg gaaaaagagt tggtagctct tgatccggca 8640 

aacaaaccac cgctggtagc ggtggttttt ttgtttgcaa gcagcagatt acgcgcagaa 8700 

aaaaaggatc tcaagaagat cctttgatct tttctacggg gtctgacgct cagtggaacg 8760 

aaaactcacg ttaagggatt ttggtcatga gattatcaaa aaggatcttc acctagatcc 8820 

ttttaaatta aaaatgaagt tttaaatcaa tctaaagtat atatgagtaa acttggtctg 8880 

acagttacca atgcttaatc agtgaggcac ctatctcagc gatctgtcta tttcgttcat 8940 

ccatagttgc ctgactcccc gtcgtgtaga taactacgat acgggagggc ttaccatctg 9000 

gccccagtgc tgcaatgata ccgcgagacc cacgctcacc ggctccagat ttatcagcaa 9060 

taaaccagcc agccggaagg gccgagcgca gaagtggtcc tgcaacttta tccgcctcca 912 0 

tccagtctat taattgttgc cgggaagcta gagtaagtag ttcgccagtt aatagtttgc 9180 

gcaacgttgt tgccattgct acaggcatcg tggtgtcacg ctcgtcgttt ggtatggctt 9240 

cattcagctc cggttcccaa cgatcaaggc gagttacatg atcccccatg ttgtgcaaaa 93 00 

aagcggttag ctccttcggt cctccgatcg ttgtcagaag taagttggcc gcagtgttat 9360 

cactcatggt tatggcagca ctgcataatt ctcttactgt catgccatcc gtaagatgct 9420 

tttctgtgac tggtgagtac tcaaccaagt cattctgaga atagtgtatg cggcgaccga 94 80 

gttgctcttg cccggcgtca atacgggata ataccgcgcc acatagcaga actttaaaag 9540 

tgctcatcat tggaaaacgt tcttcggggc gaaaactctc aaggatctta ccgctgttga 9600 

gatccagttc gatgtaaccc actcgtgcac ccaactgatc ttcagcatct tttactttca 9660 

ccagcgtttc tgggtgagca aaaacaggaa ggcaaaatgc cgcaaaaaag ggaataaggg 9720 

cgacacggaa atgttgaata ctcatactct tcctttttca atattattga agcatttatc 9780 

agggttattg tctcatgagc ggatacatat ttgaatgtat ttagaaaaat aaacaaatag 9840 

gggttccgcg cacatttccc cgaaaagtgc cacctaaatt gtaagcgtta atattttgtt 9900 

aaaattcgcg ttaaattttt gttaaatcag ctcatttttt aaccaatagg ccgaaatcgg 9960 

caaaatccct tataaatcaa aagaatagac cgagataggg ttgagtgttg ttccagtttg 10020 

gaacaagagt ccactattaa agaacgtgga ctccaacgtc aaagggcgaa aaaccgtcta 10080 

tcagggcgat ggcccactac gtgataactt cgtataatgt atgctatacg aagttatcac 10140 
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tacgtgaacc atcaccctaa tcaagttttt tggggtcgag gtgccgtaaa gcactaaatc 10200 

ggaaccctaa agggagcccc cgatttagag cttgacgggg aaagccaacc tggcttatcg 10260 

aaattaatac gactcactat agggagaccg gc 102 92 



<210> 49 
<211> 40 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 49 

ctagcataac ttcgtataat gtatgctata cgaagttatt 

<210> 50 
<211> 40 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer • 
<400> 50 

ctagaataac ttcgtatagc atacattata cgaagttatg 



<210> 51 
<211> 12206 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial 
oligonucleotide 



Sequence: Synthetic 



<400> 51 

agatctcccg 

agccagtatc 

taagctacaa 

cgttttgcgc 

agttattaat 

gttacataac 

acgtcaataa 

tgggtggact 

agtacgcccc 

atgaccttat 

atggtgatgc 

tttccaagtc 

gactttccaa 

ggcgtgtacg 

gaagctttat 

caacagtctc 

tgaggcactg 

ctgggcttgt 

gacatccact 

ggctagagta 



atcccctatg 
tgctccctgc 
caaggcaagg 
tgcttcgcga 
agtaatcaat 
ttacggtaaa 
tgacgtatgt 
atttacggta 
ctattgacgt 
gggactttcc 
ggttttggca 
tccaccccat 
aatgtcgtaa 
gtgggaggtc 
tgcggtagtt 
gaacttaagc 
ggcaggtaag 
cgagacagag 
ttgcctttct 
cttaatacga 



gtcgactctc 
ttgtgtgttg 
cttgaccgac 
tgtacgggcc 
tacggggtca 
tggcccgcct 
tcccatagta 
aactgcccac 
caatgacggt 
tacttggcag 
gtacaccaat 
tgacgtcaat 
caactgcgat 
tatataagca 
tatcacagtt 
tgcagtgact 
tatcaaggtt 
aagactcttg 
ctccacaggt 
ctcactatag 



agtacaatct 
gaggtcgctg 
aattgcatga 
agatatacgc 
ttagttcata 
ggctgaccgc 
acgccaatag 
ttggcagtac 
aaatggcccg 
tacatctacg 
gggcgtggat 
gggagtttgt 
cgcccgcccc 
gagctcgttt 
aaattgctaa 
ctcttaaggt 
acaagacagg 
cgtttctgat 
gtccactccc 
gctagcctcg 



gctctgatgc 
agtagtgcgc 
agaatctgct 
gttgacattg 
gcccatatat 
ccaacgaccc 
ggactttcca 
atcaagtgta 
cctggcatta 
tattagtcat 
agcggtttga 
tttggcacca 
gttgacgcaa 
agtgaaccgt 
cgcagtcagt 
agccttgcag 
tttaaggaga 
aggcacctat 
agttcaatta 
aggtcgacgg 



cgcatagtta 
gagcaaaatt 
tagggttagg 
attattgact 
ggagttccgc 
ccgcccattg 
ttgacgtcaa 
tcatatgcca 
tgcccagtac 
cgctattacc 
ctcacgggga 
aaatcaacgg 
atgggcggta 
cagatcacta 
gcttctgaca 
aagttggtcg 
ccaatagaaa 
tggtcttact 
cagctcttaa 
tatcgcccga 



60 
12 0 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
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acagggacct gagaggggcg cagaccctac ctgttgaacc tggctgatcg taggatcccc 1260 

gggacagcag aggagaactt acagaagtct tctggaggtg ttcctggcca gaacacagga 1320 

ggacaggtaa gatgggagac cctttgacat ggagcaaggc gctcaagaag ttagagaagg 13 80 

tgacggtaca agggtctcag aaattaacta ctggtaactg taattgggcg ctaagtctag 144 0 

tagacttatt tcatgatacc aactttgtaa aagaaaagga ctggcagctg agggatgtca 1500 

ttccattgct ggaagatgta actcagacgc tgtcaggaca agaaagagag gcctttgaaa 1560 

gaacatggtg ggcaatttct gctgtaaaga tgggcctcca gattaataat gtagtagatg 1620 

gaaaggcatc attccagctc ctaagagcga aatatgaaaa gaagactgct aataaaaagc 1680 

agtctgagcc ctctgaagaa tatccaatca tgatagatgg ggctggaaac agaaatttta 174 0 

gacctctaac acctagagga tatactactt gggtgaatac catacagaca aatggtctat 1800 

taaatgaagc tagtcaaaac ttatttggga tattatcagt agactgtact tctgaagaaa 1860 

tgaatgcatt tttggatgtg gtacctggcc aggcaggaca aaagcagata ttacttgatg 1920 

caattgataa gatagcagat gattgggata atagacatcc attaccgaat gctccactgg 1980 

tggcaccacc acaagggcct attcccatga cagcaaggtt tattagaggt ttaggagtac 2 04 0 

ctagagaaag acagatggag cctgcttttg atcagtttag gcagacatat agacaatgga 2100 

taatagaagc catgtcagaa ggcatcaaag tgatgattgg aaaacctaaa gctcaaaata 2160 

ttaggcaagg agctaaggaa ccttacccag aatttgtaga cagactatta tcccaaataa 2220 

aaagtgaggg acatccacaa gagatttcaa aattcttgac tgatacactg actattcaga 2280 

acgcaaatga ggaatgtaga aatgctatga gacatttaag accagaggat acattagaag 2340 

agaaaatgta tgcttgcaga gacattggaa ctacaaaaca aaagatgatg ttattggcaa 2400 

aagcacttca gactggtctt gcgggcccat ttaaaggtgg agccttgaaa ggagggccac 2460 

taaaggcagc acaaacatgt tataactgtg ggaagccagg acatttatct agtcaatgta 2520 

gagcacctaa agtctgtttt aaatgtaaac agcctggaca tttctcaaag caatgcagaa 2580 

gtgttccaaa aaacgggaag caaggggctc aagggaggcc ccagaaacaa actttcccga 2 64 0 

tacaacagaa gagtcagcac aacaaatctg ttgtacaaga gactcctcag actcaaaatc 2700 

tgtacccaga tctgagcgaa ataaaaaagg aatacaatgt caaggagaag gatcaagtag 2760 

aggatctcaa cctggacagt ttgtgggagt aacatataat ctagagaaaa ggcctactac 282 0 

aatagtatta attaatgata ctcccttaaa tgtactgtta gacacaggag cagatacttc 2 880 

agtgttgact actgcacatt ataataggtt aaaatataga gggagaaaat atcaagggac 2 940 

gggaataata ggagtgggag gaaatgtgga aacattttct acgcctgtga ctataaagaa 3000 

aaagggtaga cacattaaga caagaatgct agtggcagat attccagtga ctattttggg 3060 

acgagatatt cttcaggact taggtgcaaa attggttttg gcacagctct ccaaggaaat 312 0 

aaaatttaga aaaatagagt taaaagaggg cacaatgggg ccaaaaattc ctcaatggcc 3180 

actcactaag gagaaactag aaggggccaa agagatagtc caaagactat tgtcagaggg 3240 

aaaaatatca gaagctagtg acaataatcc ttataattca cccatatttg taataaaaaa 3300 

gaggtctggc aaatggaggt tattacaaga tctgagagaa ttaaacaaaa cagtacaagt 33 60 

aggaacggaa atatccagag gattgcctca cccgggagga ttaattaaat gtaaacacat 3420 

gactgtatta gatattggag atgcatattt cactataccc ttagatccag agtttagacc 3480 

atatacagct ttcactattc cctccattaa tcatcaagaa ccagataaaa gatatgtgtg 3540 

gaaatgttta ccacaaggat tcgtgttgag cccatatata tatcagaaaa cattacagga 3600 

aattttacaa ccttttaggg aaagatatcc tgaagtacaa ttgtatcaat atatggatga 3660 

tttgttcatg ggaagtaatg gttctaaaaa acaacacaaa gagttaatca tagaattaag 3720 

ggcgatctta ctggaaaagg gttttgagac accagatgat aaattacaag aagtgccacc 3780 

ttatagctgg ctaggttatc aactttgtcc tgaaaattgg aaagtacaaa aaatgcaatt 3 84 0 

agacatggta aagaatccaa cccttaatga tgtgcaaaaa ttaatgggga atataacatg 3 900 

gatgagctca gggatcccag ggttgacagt aaaacacatt gcagctacta ctaagggatg 3960 

tttagagttg aatcaaaaag taatttggac ggaagaggca caaaaagagt tagaagaaaa 4020 

taatgagaag attaaaaatg ctcaagggtt acaatattat aatccagaag aagaaatgtt 4080 

atgtgaggtt gaaattacaa aaaattatga ggcaacttat gttataaaac aatcacaagg 4140 

aatcctatgg gcaggtaaaa agattatgaa ggctaataag ggatggtcaa cagtaaaaaa 4200 

tttaatgtta ttgttgcaac atgtggcaac agaaagtatt actagagtag gaaaatgtcc 4260 

aacgtttaag gtaccattta ccaaagagca agtaatgtgg gaaatgcaaa aaggatggta 4 32 0 

ttattcttgg ctcccagaaa tagtatatac acatcaagta gttcatgatg attggagaat 4380 

gaaattggta gaagaaccta catcaggaat aacaatatac actgatgggg gaaaacaaaa 444 0 
tggagaagga atagcagctt atgtgaccag taatgggaga actaaacaga aaaggttagg 4500 
acctgtcact catcaagttg ctgaaagaat ggcaatacaa atggcattag aggataccag 4560 
agataaacaa gtaaatatag taactgatag ttattattgt tggaaaaata ttacagaagg 4 62 0 
attaggttta gaaggaccac aaagtccttg gtggcctata atacaaaata tacgagaaaa 4680 
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agagatagtt tattttgctt gggtacctgg tcacaaaggg atatatggta atcaattggc 4740 

agatgaagcc gcaaaaataa aagaagaaat catgctagca taccaaggca cacaaattaa 4800 

agagaaaaga gatgaagatg cagggtttga cttatgtgtt ccttatgaca tcatgatacc 4860 

tgtatctgac acaaaaatca tacccacaga tgtaaaaatt caagttcctc ctaatagctt 4920 

tggatgggtc actgggaaat catcaatggc aaaacagggg ttattaatta atggaggaat 4 980 

aattgatgaa ggatatacag gagaaataca agtgatatgt actaatattg gaaaaagtaa 5040 

tattaaatta atagagggac aaaaatttgc acaattaatt atactacagc atcactcaaa 5100 

ttccagacag ccttgggatg aaaataaaat atctcagaga ggggataaag gatttggaag 5160 

tacaggagta ttctgggtag aaaatattca ggaagcacaa gatgaacatg agaattggca 5220 

tacatcacca aagatattgg caagaaatta taagatacca ttgactgtag caaaacagat 5280 

aactcaagaa tgtcctcatt gcactaagca aggatcagga cctgcaggtt gtgtcatgag 5340 

atctcctaat cattggcagg cagattgcac acatttggac aataagataa tattgacttt 5400 

tgtagagtca aattcaggat acatacatgc tacattattg tcaaaagaaa atgcattatg 5460 

tacttcattg gctattttag aatgggcaag attgttttca ccaaagtcct tacacacaga 5520 

taacggcact aattttgtgg cagaaccagt tgtaaatttg ttgaagttcc taaagatagc 5580 

acataccaca ggaataccat atcatccaga aagtcagggt attgtagaaa gggcaaatag 5640 

gaccttgaaa gagaagattc aaagtcatag agacaacact caaacactgg aggcagcttt 5700 

acaacttgct ctcattactt gtaacaaagg gagggaaagt atgggaggac agacaccatg 5760 

ggaagtattt atcactaatc aagcacaagt aatacatgag aaacttttac tacagcaagc 5820 

acaatcctcc aaaaaatttt gtttttacaa aatccctggt gaacatgatt ggaagggacc 5880 

tactagggtg ctgtggaagg gtgatggtgc agtagtagtt aatgatgaag gaaagggaat 5 94 0 

aattgctgta ccattaacca ggactaagtt actaataaaa ccaaattgag tattgttgca 6000 

ggaagcaaga cccaactacc attgtcagct gtgtttcctg acctcaatat ttgttataag 6060 

gtttgatatg aatcccaggg ggaatctcaa cccctattac ccaacagtca gaaaaatcta 6120 

agtgtgagga gaacacaatg tttcaacctt attgttataa taatgacagt aagaacagca 6180 

tggcagaatc gaaggaagca agagaccaag aaatgaacct gaaagaagaa tctaaagaag 6240 

aaaaaagaag aaatgactgg tggaaaatag gtatgtttct gttatgctta gcaggaacta 6300 

ctggaggaat actttggtgg tatgaaggac tcccacagca acattatata gggttggtgg 63 60 

cgataggggg aagattaaac ggatctggcc aatcaaatgc tatagaatgc tggggttcct 642 0 

tcccggggtg tagaccattt caaaattact tcagttatga gaccaataga agcatgcata 6480 

tggataataa tactgctaca ttattagaag ctttaaccaa tataactgct ctataaataa 6540 

caaaacagaa ttagaaacat ggaagttagt aaagacttct ggcataactc ctttacctat 6600 

ttcttctgaa gctaacactg gactaattag acataagaga gattttggta taagtgcaat 6660 

agtggcagct attgtagccg ctactgctat tgctgctagc gctactatgt cttatgttgc 6720 

tctaactgag gttaacaaaa taatggaagt acaaaatcat acttttgagg tagaaaatag 6780 

tactctaaat ggtatggatt taatagaacg acaaataaag atattatatg ctatgattct 6840 

tcaaacacat gcagatgttc aactgttaaa ggaaagacaa caggtagagg agacatttaa 6900 

tttaattgga tgtatagaaa gaacacatgt attttgtcat actggtcatc cctggaatat 6960 

gtcatgggga catttaaatg agtcaacaca atgggatgac tgggtaagca aaatggaaga 7 02 0 

tttaaatcaa gagatactaa ctacacttca tggagccagg aacaatttgg cacaatccat 7080 

gataacattc aatacaccag atagtatagc tcaatttgga aaagaccttt ggagtcatat 714 0 

tggaaattgg attcctggat tgggagcttc cattataaaa tatatagtga tgtttttgct 7200 

tatttatttg ttactaacct cttcgcctaa gatcctcagg gccctctgga aggtgaccag 7260 

tggtgcaggg tcctccggca gtcgttacct gaagaaaaaa ttccatcaca aacatgcatc 7320 

gcgagaagac acctgggacc aggcccaaca caacatacac ctagcaggcg tgaccggtgg 7380 

atcaggggac aaatactaca agcagaagta ctccaggaac gactggaatg gagaatcaga 7440 

ggagtacaac aggcggccaa agagctgggt gaagtcaatc gaggcatttg gagagagcta 7500 

tatttccgag aagaccaaag gggagatttc tcagcctggg gcggctatca acgagcacaa 7560 

gaacggctct ggggggaaca atcctcacca agggtcctta gacctggaga ttcgaagcga 7620 

aggaggaaac atttatgact gttgcattaa agcccaagaa ggaactctcg ctatcccttg 7680 

ctgtggattt cccttatggc tattttgggg gtcgacccgg gcggccgcac tagaggaatt 7740 

cgcccctctc cctccccccc ccctaacgtt actggccgaa gccgcttgga ataaggccgg 7800 

tgtgtgtttg tctatatgtg attttccacc atattgccgt cttttggcaa tgtgagggcc 7860 

cggaaacctg gccctgtctt cttgacgagc attcctaggg gtctttcccc tctcgccaaa 7920 

ggaatgcaag gtctgttgaa tgtcgtgaag gaagcagttc ctctggaagc ttcttgaaga 7980 

caaacaacgt ctgtagcgac cctttgcagg cagcggaacc ccccacctgg cgacaggtgc 8040 

ctctgcggcc aaaagccacg tgtataagat acacctgcaa aggcggcaca accccagtgc 8100 

cacgttgtga gttggatagt tgtggaaaga gtcaaatggc tctcctcaag cgtagtcaac 8160 
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8220 
8280 



aaggggctga aggatgccca gaaggtaccc cattgtatgg gaatctgatc tggggcctcg 
gtgcacatgc tttacatgtg tttagtcgag gttaaaaaag ctctaggccc cccgaaccac 
ggggacgtgg ttttcctttg aaaaacacga tgataagctt gccacaaccc cgtaccaaag 8340 
atggatagat ccggaaagcc tgaactcacc gcgacgtctg tcgagaagtt tctgatcgaa 8400 
aagttcgaca gcgtctccga cctgatgcag ctctcggagg gcgaagaatc tcgtgctttc 8460 
agcttcgatg taggagggcg tggatatgtc ctgcgggtaa atagctgcgc cgatggtttc 8520 
tacaaagatc gttatgttta tcggcacttt gcatcggccg cgctcccgat tccggaagtg 8580 
cttgacattg gggaattcag cgagagcctg acctattgca tctcccgccg tgcacagggt 
gtcacgttgc aagacctgcc tgaaaccgaa ctgcccgctg ttctgcagcc ggtcgcggag 
gccatggatg cgatcgctgc ggccgatctt agccagacga gcgggttcgg cccattcgga 8760 
ccgcaaggaa tcggtcaata cactacatgg cgtgatttca tatgcgcgat tgctgatccc 8820 
catgtgtatc actggcaaac tgtgatggac gacaccgtca gtgcgtccgt cgcgcaggct 8880 
ctcgatgagc tgatgctttg ggccgaggac tgccccgaag tccggcacct cgtgcacgcg 8940 
gatttcggct ccaacaatgt cctgacggac aatggccgca taacagcggt cattgactgg 9000 
agcgaggcga tgttcgggga ttcccaatac gaggtcgcca acatcttctt ctggaggccg 
tggttggctt gtatggagca gcagacgcgc tacttcgagc ggaggcatcc ggagcttgca 
ggatcgccgc ggctccgggc gtatatgctc cgcattggtc ttgaccaact ctatcagagc 
ttggttgacg gcaatttcga tgatgcagct tgggcgcagg gtcgatgcga cgcaatcgtc 
cgatccggag ccgggactgt cgggcgtaca caaatcgccc gcagaagcgc ggccgtctgg 
accgatggct gtgtagaagt actcgccgat agtggaaacc gacgccccag cactcgtccg 9360 
agggcaaagg aatagagtag atgccgaccg aacaagagct gatttcgaga acgcctcagc 9420 
cagcaactcg cgcgagccta gcaaggcaaa tgcgagagaa cggccttacg cttggtggca 9480 
cagttctcgt ccacagttcg ctaagctcgc tcggctgggt cgcgggaggg ccggtcgcag 9540 
tgattcaggc ccttctggat tgtgttggtc cccagggcac gattgtcatg cccacgcact 9600 



8520 
8580 
8640 
8700 
8760 
8820 
8880 
8940 
9000 
9060 
9120 
9180 
9240 
9300 



cgggtgatct gactgatccc gcagattgga gatcgccgcc cgtgcctgcc gattgggtgc 

9780 
9840 

tggggcagga cagcaagggg gaggattggg aagacaatag caggcatgct ggggatgcgg 990 0 



^yyy zj^-^^-zj — ~> — * ~> — 

agatctagag ctcgctgatc agcctcgact gtgcctctag ttgccagcca tctgttgttt 

gcccctcccc cgtgccttcc ttgaccctgg aaggtgccac tcccactgtc ctttcctaat 9780 

aaaatgagga aattgcatcg cattgtctga gtaggtgtca ttctattctg gggggtgggg 
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tgggctctat ggcttctgag gcggaaagaa ccagctgggg ctcgagtgca ttctagttgt 9960 
ggtttgtcca aactcatcaa tgtatcttat catgtctgta taccgtcgac ctctagctag 10020 
agcttggcgt aatcatggtc atagctgttt cctgtgtgaa attgttatcc gctcacaatt 10080 
ccacacaaca tacgagccgg aagcataaag tgtaaagcct ggggtgccta atgagtgagc 
taactcacat taattgcgtt gcgctcactg cccgctttcc agtcgggaaa cctgtcgtgc 
cagctgcatt aatgaatcgg ccaacgcgcg gggagaggcg gtttgcgtat tgggcgctct 
tccgcttcct cgctcactga ctcgctgcgc tcggtcgttc ggctgcggcg agcggtatca 
gctcactcaa aggcggtaat acggttatcc acagaatcag gggataacgc aggaaagaac 
atgtgagcaa aaggccagca aaaggccagg aaccgtaaaa aggccgcgtt gctggcgttt 
ttccataggc tccgcccccc tgacgagcat cacaaaaatc gacgctcaag tcagaggtgg 
cgaaacccga caggactata aagataccag gcgtttcccc ctggaagctc cctcgtgcgc 10560 
tctcctgttc cgaccctgcc gcttaccgga tacctgtccg cctttctccc ttcgggaagc m*™ 
gtggcgcttt ctcaatgctc acgctgtagg tatctcagtt cggtgtaggt cgttcgctcc 
aagctgggct gtgtgcacga accccccgtt cagcccgacc gctgcgcctt atccggtaac 
tatcgtcttg agtccaaccc ggtaagacac gacttatcgc cactggcagc agccactggt 10800 
aacaggatta gcagagcgag gtatgtaggc ggtgctacag agttcttgaa gtggtggcct 10860 
aactacggct acactagaag gacagtattt ggtatctgcg ctctgctgaa gccagttacc 10920 
ttcggaaaaa gagttggtag ctcttgatcc ggcaaacaaa ccaccgctgg tagcggtggt 10980 
ttttttgttt gcaagcagca gattacgcgc agaaaaaaag gatctcaaga agatcctttg 11040 
atcttttcta cggggtctga cgctcagtgg aacgaaaact cacgttaagg gattttggtc 11100 
atgagattat caaaaaggat cttcacctag atccttttaa attaaaaatg aagttttaaa 
tcaatctaaa gtatatatga gtaaacttgg tctgacagtt accaatgctt aatcagtgag 
gcacctatct cagcgatctg tctatttcgt tcatccatag ttgcctgact ccccgtcgtg 
tagataacta cgatacggga gggcttacca tctggcccca gtgctgcaat gataccgcga 11340 
gacccacgct caccggctcc agatttatca gcaataaacc agccagccgg aagggccgag 11400 
cgcagaagtg gtcctgcaac tttatccgcc tccatccagt ctattaattg ttgccgggaa 11460 
gctagagtaa gtagttcgcc agttaatagt ttgcgcaacg ttgttgccat tgctacaggc 1152 0 
atcgtggtgt cacgctcgtc gtttggtatg gcttcattca gctccggttc ccaacgatca 11580 
aggcgagtta catgatcccc catgttgtgc aaaaaagcgg ttagctcctt cggtcctccg 11640 



10140 
10200 
10260 
10320 
10380 
10440 
10500 



10620 
10680 
10740 



11160 
11220 
11280 
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atcgttgtca gaagtaagtt ggccgcagtg ttatcactca tggttatggc agcactgcat 11700 

aattctctta ctgtcatgcc atccgtaaga tgcttttctg tgactggtga gtactcaacc 11760 

aagtcattct gagaatagtg tatgcggcga ccgagttgct cttgcccggc gtcaatacgg 1182 0 

gataataccg cgccacatag cagaacttta aaagtgctca tcattggaaa acgttcttcg 11880 

gggcgaaaac tctcaaggat cttaccgctg ttgagatcca gttcgatgta acccactcgt 11940 

gcacccaact gatcttcagc atcttttact ttcaccagcg tttctgggtg agcaaaaaca 12000 

ggaaggcaaa atgccgcaaa aaagggaata agggcgacac ggaaatgttg aatactcata 12060 

ctcttccttt ttcaatatta ttgaagcatt tatcagggtt attgtctcat gagcggatac 12120 

atatttgaat gtatttagaa aaataaacaa ataggggttc cgcgcacatt tccccgaaaa 12180 
gtgccacctg acgtcgacgg atcggg 



12206 



<210> 52 
<211> 11592 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
oligonucleotide 

<400> 52 

agatcttgaa taataaaatg tgtgtttgtc cgaaatacgc gttttgagat ttctgtcgcc 60 

gactaaattc atgtcgcgcg atagtggtgt ttatcgccga tagagatggc gatattggaa 12 0 

aaattgatat ttgaaaatat ggcatattga aaatgtcgcc gatgtgagtt tctgtgtaac 180 

tgatatcgcc atttttccaa aagtgatttt tgggcatacg cgatatctgg cgatagcgct 240 

tatatcgttt acgggggatg gcgatagacg actttggtga cttgggcgat tctgtgtgtc 300 

gcaaatatcg cagtttcgat ataggtgaca gacgatatga ggctatatcg ccgatagagg 360 

cgacatcaag ctggcacatg gccaatgcat atcgatctat acattgaatc aatattggcc 420 

attagccata ttattcattg gttatatagc ataaatcaat attggctatt ggccattgca 480 

tacgttgtat ccatatcgta atatgtacat ttatattggc tcatgtccaa cattaccgcc 540 

atgttgacat tgattattga ctagttatta atagtaatca attacggggt cattagttca 600 

tagcccatat atggagttcc gcgttacata acttacggta aatggcccgc ctggctgacc 660 

gcccaacgac ccccgcccat tgacgtcaat aatgacgtat gttcccatag taacgccaat 72 0 

agggactttc cattgacgtc aatgggtgga gtatttacgg taaactgccc acttggcagt 780 

acatcaagtg tatcatatgc caagtccgcc ccctattgac gtcaatgacg gtaaatggcc 840 

cgcctggcat tatgcccagt acatgacctt acgggacttt cctacttggc agtacatcta 900 

cgtattagtc atcgctatta ccatggtgat gcggttttgg cagtacacca atgggcgtgg 960 

atagcggttt gactcacggg gatttccaag tctccacccc attgacgtca atgggagttt 1020 

gttttggcac caaaatcaac gggactttcc aaaatgtcgt aacaactgcg atcgcccgcc 1080 

ccgttgacgc aaatgggcgg taggcgtgta cggtgggagg tctatataag cagagctcgt 1140 

ttagtgaacc gggcactcag attctgcggt ctgagtccct tctctgctgg gctgaaaagg 1200 

cctttgtaat aaatataatt ctctactcag tccctgtctc tagtttgtct gttcgagatc 1260 

ctacagttgg cgcccgaaca gggacctgag aggggcgcag accctacctg ttgaacctgg 1320 

ctgatcgtag gatccccggg acagcagagg agaacttaca gaagtcttct ggaggtgttc 1380 

ctggccagaa cacaggagga caggtaagat tgggagaccc tttgacattg gagcaaggcg 144 0 

ctcaagaagt tagagaaggt gacggtacaa gggtctcaga aattaactac tggtaactgt 1500 

aattgggcgc taagtctagt agacttattt cattgatacc aactttgtaa aagaaaagga 1560 

ctggcagctg agggattgtc attccattgc tggaagattg taactcagac gctgtcagga 162 0 

caagaaagag aggcctttga aagaacattg gtgggcaatt tctgctgtaa agattgggcc 1680 

tccagattaa taattgtagt agattggaaa ggcatcattc cagctcctaa gagcgaaata 1740 

ttgaaaagaa gactgctaat aaaaagcagt ctgagccctc tgaagaatat ctctagaact 1800 

agtggatccc ccgggctgca ggagtgggga ggcacgatgg ccgctttggt cgaggcggat 1860 

ccggccatta gccatattat tcattggtta tatagcataa atcaatattg gctattggcc 192 0 

attgcatacg ttgtatccat atcataatat gtacatttat attggctcat gtccaacatt 1980 

accgccatgt tgacattgat tattgactag ttattaatag taatcaatta cggggtcatt 2040 

agttcatagc ccatatatgg agttccgcgt tacataactt acggtaaatg gcccgcctgg 2100 

ctgaccgccc aacgaccccc gcccattgac gtcaataatg acgtatgttc ccatagtaac 2160 
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gccaataggg actttccatt gacgtcaatg ggtggagtat ttacggtaaa ctgcccactt 2220 

ggcagtacat caagtgtatc atatgccaag tacgccccct attgacgtca atgacggtaa 2280 

atggcccgcc tggcattatg cccagtacat gaccttatgg gactttccta cttggcagta 2340 

catctacgta ttagtcatcg ctattaccat ggtgatgcgg ttttggcagt acatcaatgg 2400 

gcgtggatag cggtttgact cacggggatt tccaagtctc caccccattg acgtcaatgg 2460 

gagtttgttt tggcaccaaa atcaacggga ctttccaaaa tgtcgtaaca actccgcccc 2520 

attgacgcaa atgggcggta ggcatgtacg gtgggaggtc tatataagca gagctcgttt 2580 

agtgaaccgt cagatcgcct ggagacgcca tccacgctgt tttgacctcc atagaagaca 2640 

ccgggaccga tccagcctcc gcggccccaa gcttcagctg ctcgaggatc tgcggatccg 2700 

gggaattccc cagtctcagg atccaccatg ggggatcccg tcgttttaca acgtcgtgac 2760 

tgggaaaacc ctggcgttac ccaacttaat cgccttgcag cacatccccc tttcgccagc 2820 

tggcgtaata gcgaagaggc ccgcaccgat cgcccttccc aacagttgcg cagcctgaat 2880 

ggcgaatggc gctttgcctg gtttccggca ccagaagcgg tgccggaaag ctggctggag 2 940 

tgcgatcttc ctgaggccga tactgtcgtc gtcccctcaa actggcagat gcacggttac 3000 

gatgcgccca tctacaccaa cgtaacctat cccattacgg tcaatccgcc gtttgttccc 3060 

acggagaatc cgacgggttg ttactcgctc acatttaatg ttgatgaaag ctggctacag 312 0 

gaaggccaga cgcgaattat ttttgatggc gttaactcgg cgtttcatct gtggtgcaac 3180 

gggcgctggg tcggttacgg ccaggacagt cgtttgccgt ctgaatttga cctgagcgca 3240 

tttttacgcg ccggagaaaa ccgcctcgcg gtgatggtgc tgcgttggag tgacggcagt 33 00 

tatctggaag atcaggatat gtggcggatg agcggcattt tccgtgacgt ctcgttgctg 3360 

cataaaccga ctacacaaat cagcgatttc catgttgcca ctcgctttaa tgatgatttc 3420 

agccgcgctg tactggaggc tgaagttcag atgtgcggcg agttgcgtga ctacctacgg 3480 

gtaacagttt ctttatggca gggtgaaacg caggtcgcca gcggcaccgc gcctttcggc 3540 

ggtgaaatta tcgatgagcg tggtggttat gccgatcgcg tcacactacg tctgaacgtc 3600 

gaaaacccga aactgtggag cgccgaaatc ccgaatctct atcgtgcggt ggttgaactg 3660 

cacaccgccg acggcacgct gattgaagca gaagcctgcg atgtcggttt ccgcgaggtg 3720 

cggattgaaa atggtctgct gctgctgaac ggcaagccgt tgctgattcg aggcgttaac 3780 

cgtcacgagc atcatcctct gcatggtcag gtcatggatg agcagacgat ggtgcaggat 3 84 0 

atcctgctga tgaagcagaa caactttaac gccgtgcgct gttcgcatta tccgaaccat 3900 

ccgctgtggt acacgctgtg cgaccgctac ggcctgtatg tggtggatga agccaatatt 3 960 

gaaacccacg gcatggtgcc aatgaatcgt ctgaccgatg atccgcgctg gctaccggcg 4020 

atgagcgaac gcgtaacgcg aatggtgcag cgcgatcgta atcacccgag tgtgatcatc 4080 

tggtcgctgg ggaatgaatc aggccacggc gctaatcacg acgcgctgta tcgctggatc 4140 

aaatctgtcg atccttcccg cccggtgcag tatgaaggcg gcggagccga caccacggcc 4200 

accgatatta tttgcccgat gtacgcgcgc gtggatgaag accagccctt cccggctgtg 4260 

ccgaaatggt ccatcaaaaa atggctttcg ctacctggag agacgcgccc gctgatcctt 4320 

tgcgaatacg cccacgcgat gggtaacagt cttggcggtt tcgctaaata ctggcaggcg 4380 

tttcgtcagt atccccgttt acagggcggc ttcgtctggg actgggtgga tcagtcgctg 4440 

attaaatatg atgaaaacgg caacccgtgg tcggcttacg gcggtgattt tggcgatacg 4500 

ccgaacgatc gccagttctg tatgaacggt ctggtctttg ccgaccgcac gccgcatcca 4560 

gcgctgacgg aagcaaaaca ccagcagcag tttttccagt tccgtttatc cgggcaaacc 4620 

atcgaagtga ccagcgaata cctgttccgt catagcgata acgagctcct gcactggatg 4680 

gtggcgctgg atggtaagcc gctggcaagc ggtgaagtgc ctctggatgt cgctccacaa 474 0 

ggtaaacagt tgattgaact gcctgaacta ccgcagccgg agagcgccgg gcaactctgg 4800 

ctcacagtac gcgtagtgca accgaacgcg accgcatggt cagaagccgg gcacatcagc 4 860 

gcctggcagc agtggcgtct ggcggaaaac ctcagtgtga cgctccccgc cgcgtcccac 4 92 0 

gccatcccgc atctgaccac cagcgaaatg gatttttgca tcgagctggg taataagcgt 4980 

tggcaattta accgccagtc aggctttctt tcacagatgt ggattggcga taaaaaacaa 5040 

ctgctgacgc cgctgcgcga tcagttcacc cgtgcaccgc tggataacga cattggcgta 5100 

agtgaagcga cccgcattga ccctaacgcc tgggtcgaac gctggaaggc ggcgggccat 5160 

taccaggccg aagcagcgtt gttgcagtgc acggcagata cacttgctga tgcggtgctg 5220 

attacgaccg ctcacgcgtg gcagcatcag gggaaaacct tatttatcag ccggaaaacc 5280 

taccggattg atggtagtgg tcaaatggcg attaccgttg atgttgaagt ggcgagcgat 534 0 

acaccgcatc cggcgcggat tggcctgaac tgccagctgg cgcaggtagc agagcgggta 5400 

aactggctcg gattagggcc gcaagaaaac tatcccgacc gccttactgc cgcctgtttt 5460 

gaccgctggg atctgccatt gtcagacatg tataccccgt acgtcttccc gagcgaaaac 5520 

ggtctgcgct gcgggacgcg cgaattgaat tatggcccac accagtggcg cggcgacttc 55 8 0 

cagttcaaca tcagccgcta cagtcaacag caactgatgg aaaccagcca tcgccatctg 5640 
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ctgcacgcgg aagaaggcac atggctgaat atcgacggtt tccatatggg gattggtggc 5700 

gacgactcct ggagcccgtc agtatcggcg gaattccagc tgagcgccgg tcgctaccat 5760 

taccagttgg tctggtgtca aaaataataa taaccgggca ggggggatcc gcagatccgg 5820 

ctgtggaatg tgtgtcagtt agggtgtgga aagtccccag gctccccagc aggcagaagt 5880 

atgcaaagca tgcctgcagg aattcgatat caagcttatc gataccgtcg acctcgaggg 5 94 0 

ggggcccggt acccagcttt tgttcccttt agtgagggtt aattgcgcgg gaagtattta 6000 

tcactaatca agcacaagta atacatgaga aacttttact acagcaagca caatcctcca 6060 

aaaaattttg tttttacaaa atccctggtg aacatgattg gaagggacct actagggtgc 6120 

tgtggaaggg tgatggtgca gtagtagtta atgatgaagg aaagggaata attgctgtac 6180 

cattaaccag gactaagtta ctaataaaac caaattgagt attgttgcag gaagcaagac 624 0 

ccaactacca ttgtcagctg tgtttcctga cctcaatatt tgttataagg tttgatatga 6300 

atcccagggg gaatctcaac ccctattacc caacagtcag aaaaatctaa gtgtgaggag 6360 

aacacaatgt ttcaacctta ttgttataat aatgacagta agaacagcat ggcagaatcg 642 0 

aaggaagcaa gagaccaaga atgaacctga aagaagaatc taaagaagaa aaaagaagaa 64 8 0 

atgactggtg gaaaataggt atgtttctgt tatgcttagc aggaactact ggaggaatac 654 0 

tttggtggta tgaaggactc ccacagcaac attatatagg gttggtggcg atagggggaa 6600 

gattaaacgg atctggccaa tcaaatgcta tagaatgctg gggttccttc ccggggtgta 6660 

gaccatttca aaattacttc agttatgaga ccaatagaag catgcatatg gataataata 6720 

ctgctacatt attagaagct ttaaccaata taactgctct ataaataaca aaacagaatt 6780 

agaaacatgg aagttagtaa agacttctgg cataactcct ttacctattt cttctgaagc 6840 

taacactgga ctaattagac ataagagaga ttttggtata agtgcaatag tggcagctat 6900 

tgtagccgct actgctattg ctgctagcgc tactatgtct tatgttgctc taactgaggt 6960 

taacaaaata atggaagtac aaaatcatac ttttgaggta gaaaatagta ctctaaatgg 7020 

tatggattta atagaacgac aaataaagat attatatgct atgattcttc aaacacatgc 7080 

agatgttcaa ctgttaaagg aaagacaaca ggtagaggag acatttaatt taattggatg 7140 

tatagaaaga acacatgtat tttgtcatac tggtcatccc tggaatatgt catggggaca 7200 

tttaaatgag tcaacacaat gggatgactg ggtaagcaaa atggaagatt taaatcaaga 72 60 

gatactaact acacttcatg gagccaggaa caatttggca caatccatga taacattcaa 7320 

tacaccagat agtatagctc aatttggaaa agacctttgg agtcatattg gaaattggat 73 8 0 

tcctggattg ggagcttcca ttataaaata tatagtgatg tttttgctta tttatttgtt 7440 

actaacctct tcgcctaaga tcctcagggc cctctggaag gtgaccagtg gtgcagggtc 7500 

ctccggcagt cgttacctga agaaaaaatt ccatcacaaa catgcatcgc gagaagacac 7560 

ctgggaccag gcccaacaca acatacacct agcaggcgtg accggtggat caggggacaa 7620 

atactacaag cagaagtact ccaggaacga ctggaatgga gaatcagagg agtacaacag 7680 

gcggccaaag agctgggtga agtcaatcga ggcatttgga gagagctata tttccgagaa 7740 

gaccaaaggg gagatttctc agcctggggc ggctatcaac gagcacaaga acggctctgg 7800 

ggggaacaat cctcaccaag ggtccttaga cctggagatt cgaagcgaag gaggaaacat 7860 

ttatgactgt tgcattaaag cccaagaagg aactctcgct atcccttgct gtggatttcc 7920 

cttatggcta ttttggggac tagtaattat agtaggacgc atagcaggct atggattacg 7980 

tggactcgct gttataataa ggatttgtat tagaggctta aatttgatat ttgaaataat 8040 

cagaaaaatg cttgattata ttggaagagc tttaaatcct ggcacatctc atgtatcaat 8100 

gcctcagtat gtttagaaaa acaagggggg aactgtgggg tttttatgag gggttttata 8160 

aactgcagga gtggggaggc acgatggccg ctttggtcga ggcggatccg gccattagcc 8220 

atattattca ttggttatat agcataaatc aatattggct attggccatt gcatacgttg 8280 

tatccatatc ataatatgta catttatatt ggctcatgtc caacattacc gccatgttga 8340 

cattgattat tgactagtta ttaatagtaa tcaattacgg ggtcattagt tcatagccca 8400 

tatatggagt tccgcgttac ataacttacg gtaaatggcc cgcctggctg accgcccaac 8460 

gacccccgcc cattgacgtc aataatgacg tatgttccca tagtaacgcc aatagggact 8520 

ttccattgac gtcaatgggt ggagtattta cggtaaactg cccacttggc agtacatcaa 8580 

gtgtatcata tgccaagtac gccccctatt gacgtcaatg acggtaaatg gcccgcctgg 8640 

cattatgccc agtacatgac cttatgggac tttcctactt ggcagtacat ctacgtatta 8700 

gtcatcgcta ttaccatggt gatgcggttt tggcagtaca tcaatgggcg tggatagcgg 8760 

tttgactcac ggggatttcc aagtctccac cccattgacg tcaatgggag tttgttttgg 8820 

caccaaaatc aacgggactt tccaaaatgt cgtaacaact ccgccccatt gacgcaaatg 8880 

ggcggtaggc atgtacggtg ggaggtctat ataagcagag ctcgtttagt gaaccgggca 8940 

ctcagattct gcggtctgag tcccttctct gctgggctga aaaggccttt gtaataaata 9000 
taattctcta ctcagtccct gtctctagtt tgtctgttcg agatcctaca gagctcatgc 9060 
cttggcgtaa tcatggtcat agctgtttcc tgtgtgaaat tgttatccgc tcacaattcc 9120 
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acacaacata 

actcacatta 

gctgcattaa 

cgcttcctcg 

tcactcaaag 

gtgagcaaaa 

ccataggctc 

aaacccgaca 

tcctgttccg 

ggcgctttct 

gctgggctgt 

tcgtcttgag 

caggattagc 

ctacggctac 

cggaaaaaga 

ttttgtttgc 

cttttctacg 

gagattatca 

aatctaaagt 

acctatctca 

gataactacg 

cccacgctca 

cagaagtggt 

tagagtaagt 

cgtggtgtca 

gcgagttaca 

cgttgtcaga 

ttctcttact 

gtcattctga 

taataccgcg 

gcgaaaactc 

acccaactga 

aaggcaaaat 

cttccttttt 

atttgaatgt 

gccacctaaa 

agctcatttt 

accgagatag 

gactccaacg 

tcaccctaat 

gggagccccc 

actcactata 



cgagccggaa 

attgcgttgc 

tgaatcggcc 

ctcactgact 

gcggtaatac 

ggccagcaaa 

cgcccccctg 

ggactataaa 

accctgccgc 

catagctcac 

gtgcacgaac 

tccaacccgg 

agagcgaggt 

actagaagga 

gttggtagct 

aagcagcaga 

gggtctgacg 

aaaaggatct 

atatatgagt 

gcgatctgtc 

atacgggagg 

ccggctccag 

cctgcaactt 

agttcgccag 

cgctcgtcgt 

tgatccccca 

agtaagttgg 

gtcatgccat 

gaatagtgta 

ccacatagca 

tcaaggatct 

tcttcagcat 

gccgcaaaaa 

caatattatt 

atttagaaaa 

ttgtaagcgt 

ttaaccaata 

ggttgagtgt 

tcaaagggcg 

caagtttttt 

gatttagagc 

gg 



gcataaagtg 

gctcactgcc 

aacgcgcggg 

cgctgcgctc 

ggttatccac 

aggccaggaa 

acgagcatca 

gataccaggc 

ttaccggata 

gctgtaggta 

cccccgttca 

taagacacga 

atgtaggcgg 

cagtatttgg 

cttgatccgg 

ttacgcgcag 

ctcagtggaa 

tcacctagat 

aaacttggtc 

tatttcgttc 

gcttaccatc 

atttatcagc 

tatccgcctc 

ttaatagttt 

ttggtatggc 

tgttgtgcaa 

ccgcagtgtt 

ccgtaagatg 

tgcggcgacc 

gaactttaaa 

taccgctgtt 

cttttacttt 

agggaataag 

gaagcattta 

ataaacaaat 

taatattttg 

ggccgaaatc 

tgttccagtt 

aaaaaccgtc 

ggggtcgagg 

ttgacgggga 



taaagcctgg 

cgctttccag 

gagaggcggt 

ggtcgttcgg 

agaatcaggg 

ccgtaaaaag 

caaaaatcga 

gtttccccct 

cctgtccgcc 

tctcagttcg 

gcccgaccgc 

cttatcgcca 

tgctacagag 

tatctgcgct 

caaacaaacc 

aaaaaaagga 

cgaaaactca 

ccttttaaat 

tgacagttac 

atccatagtt 

tggccccagt 

aataaaccag 

catccagtct 

gcgcaacgtt 

ttcattcagc 

aaaagcggtt 

atcactcatg 

cttttctgtg 

gagttgctct 

agtgctcatc 

gagatccagt 

caccagcgtt 

ggcgacacgg 

tcagggttat 

aggggttccg 

ttaaaattcg 

ggcaaaatcc 

tggaacaaga 

tatcagggcg 

tgccgtaaag 

aagccaacct 



ggtgcctaat 

tcgggaaacc 

ttgcgtattg 

ctgcggcgag 

gataacgcag 

gccgcgttgc 

cgctcaagtc 

ggaagctccc 

tttctccctt 

gtgtaggtcg 

tgcgccttat 

ctggcagcag 

ttcttgaagt 

ctgctgaagc 

accgctggta 

tctcaagaag 

cgttaaggga 

taaaaatgaa 

caatgcttaa 

gcctgactcc 

gctgcaatga 

ccagccggaa 

attaattgtt 

gttgccattg 

tccggttccc 

agctccttcg 

gttatggcag 

actggtgagt 

tgcccggcgt 

attggaaaac 

tcgatgtaac 

tctgggtgag 

aaatgttgaa 

tgtctcatga 

cgcacatttc 

cgttaaattt 

cttataaatc 

gtccactatt 

atggcccact 

cactaaatcg 

ggcttatcga 



gagtgagcta 

tgtcgtgcca 

ggcgctcttc 

cggtatcagc 

gaaagaacat 

tggcgttttt 

agaggtggcg 

tcgtgcgctc 

cgggaagcgt 

ttcgctccaa 

ccggtaacta 

ccactggtaa 

ggtggcctaa 

cagttacctt 

gcggtggttt 

atcctttgat 

ttttggtcat 

gttttaaatc 

tcagtgaggc 

ccgtcgtgta 

taccgcgaga 

gggccgagcg 

gccgggaagc 

ctacaggcat 

aacgatcaag 

gtcctccgat 

cactgcataa 

actcaaccaa 

caatacggga 

gttcttcggg 

ccactcgtgc 

caaaaacagg 

tactcatact 

gcggatacat 

cccgaaaagt 

ttgttaaatc 

aaaagaatag 

aaagaacgtg 

acgtgaacca 

gaaccctaaa 

aattaatacg 



9180 
9240 
9300 
9360 
9420 
9480 
9540 
9600 
9660 
9720 
9780 
9840 
9900 
9960 
10020 
10080 
10140 
10200 
10260 
10320 
10380 
10440 
10500 
10560 
10620 
10680 
10740 
10800 
10860 
10920 
10980 
11040 
11100 
11160 
11220 
11280 
11340 
11400 
11460 
11520 
11580 
11592 



<210> 53 
<211> 10112 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
oligonucleotide 

<400> 53 

tcaatattgg ccattagcca tattattcat tggttatata gcataaatca atattggcta 60 

ttggccattg catacgttgt atctatatca taatatgtac atttatattg gctcatgtcc 120 

aatatgaccg ccatgttggc attgattatt gactagttat taatagtaat caattacggg 180 

gtcattagtt catagcccat atatggagtt ccgcgttaca taacttacgg taaatggccc 240 
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gcctggctga ccgcccaacg acccccgccc attgacgtca ataatgacgt atgttcccat 300 

agtaacgcca atagggactt tccattgacg tcaatgggtg gagtatttac ggtaaactgc 360 

ccacttggca gtacatcaag tgtatcatat gccaagtccg ccccctattg acgtcaatga 420 

cggtaaatgg cccgcctggc attatgccca gtacatgacc ttacgggact ttcctacttg 480 

gcagtacatc tacgtattag tcatcgctat taccatggtg atgcggtttt ggcagtacac 540 

caatgggcgt ggatagcggt ttgactcacg gggatttcca agtctccacc ccattgacgt 600 

caatgggagt ttgttttggc accaaaatca acgggacttt ccaaaatgtc gtaacaactg 660 

cgatcgcccg ccccgttgac gcaaatgggc ggtaggcgtg tacggtggga ggtctatata 72 0 

agcagagctc gtttagtgaa ccgtcagatc actagaagct ttattgcggt agtttatcac 780 

agttaaattg ctaacgcagt cagtgcttct gacacaacag tctcgaactt aagctgcagt 840 

gactctctta aggtagcctt gcagaagttg gtcgtgaggc actgggcagg taagtatcaa 900 

ggttacaaga caggtttaag gagaccaata gaaactgggc ttgtcgagac agagaagact 960 

cttgcgtttc tgataggcac ctattggtct tactgacatc cactttgcct ttctctccac 1020 

aggtgtccac tcccagttca attacagctc ttaaggctag agtacttaat acgactcact 1080 

ataggctaga gaattcgcca ccatgggcga tcccctcacc tggtccaaag ccctgaagaa 114 0 

actggaaaaa gtcaccgttc agggtagcca aaagcttacc acaggcaatt gcaactgggc 1200 

attgtccctg gtggatcttt tccacgacac taatttcgtt aaggagaaag attggcaact 1260 

cagagacgtg atccccctct tggaggacgt gacccaaaca ttgtctgggc aggagcgcga 1320 

agctttcgag cgcacctggt gggccatcag cgcagtcaaa atggggctgc aaatcaacaa 13 8 0 

cgtggttgac ggtaaagcta gctttcaact gctccgcgct aagtacgaga agaaaaccgc 144 0 

caacaagaaa caatccgaac ctagcgagga gtacccaatt atgatcgacg gcgccggcaa 1500 

taggaacttc cgcccactga ctcccagggg ctataccacc tgggtcaaca ccatccagac 1560 

aaacggactt ttgaacgaag cctcccagaa cctgttcggc atcctgtctg tggactgcac 1620 

ctccgaagaa atgaatgctt ttctcgacgt ggtgccagga caggctggac agaaacagat 1680 

cctgctcgat gccattgaca agatcgccga cgactgggat aatcgccacc ccctgccaaa 174 0 

cgcccctctg gtggctcccc cacaggggcc tatccctatg accgctaggt tcattagggg 1800 

actgggggtg ccccgcgaac gccagatgga gccagcattt gaccaattta ggcagaccta 1860 

cagacagtgg atcatcgaag ccatgagcga ggggattaaa gtcatgatcg gaaagcccaa 192 0 

ggcacagaac atcaggcagg gggccaagga accataccct gagtttgtcg acaggcttct 1980 

gtcccagatt aaatccgaag gccaccctca ggagatctcc aagttcttga cagacacact 2040 

gactatccaa aatgcaaatg aagagtgcag aaacgccatg aggcacctca gacctgaaga 2100 

taccctggag gagaaaatgt acgcatgtcg cgacattggc actaccaagc aaaagatgat 2160 

gctgctcgcc aaggctctgc aaaccggcct ggctggtcca ttcaaaggag gagcactgaa 2220 

gggaggtcca ttgaaagctg cacaaacatg ttataattgt gggaagccag gacatttatc 2280 

tagtcaatgt agagcaccta aagtctgttt taaatgtaaa cagcctggac atttctcaaa 2340 

gcaatgcaga agtgttccaa aaaacgggaa gcaaggggct caagggaggc cccagaaaca 2400 

aactttcccg atacaacaga agagtcagca caacaaatct gttgtacaag agactcctca 2460 

gactcaaaat ctgtacccag atctgagcga aataaaaaag gaatacaatg tcaaggagaa 2520 

ggatcaagta gaggatctca acctggacag tttgtgggag taacatacaa tctcgagaag 2580 

aggcccacta ccatcgtcct gatcaatgac acccctctta atgtgctgct ggacaccgga 2640 

gccgacacca gcgttctcac tactgctcac tataacagac tgaaatacag aggaaggaaa 2700 

taccagggca caggcatcat cggcgttgga ggcaacgtcg aaaccttttc cactcctgtc 2760 

accatcaaaa agaaggggag acacattaaa accagaatgc tggtcgccga catccccgtc 2 82 0 

accatccttg gcagagacat tctccaggac ctgggcgcta aactcgtgct ggcacaactg 2880 

tctaaggaaa tcaagttccg caagatcgag ctgaaagagg gcacaatggg tccaaaaatc 2940 

ccccagtggc ccctgaccaa agagaagctt gagggcgcta aggaaatcgt gcagcgcctg 3000 

ctttctgagg gcaagattag cgaggccagc gacaataacc cttacaacag ccccatcttt 3060 

gtgattaaga aaaggagcgg caaatggaga ctcctgcagg acctgaggga actcaacaag 3120 

accgtccagg tcggaactga gatctctcgc ggactgcctc accccggcgg cctgattaaa 3180 

tgcaagcaca tgacagtcct tgacattgga gacgcttatt ttaccatccc cctcgatcct 3240 

gaatttcgcc cctatactgc ttttaccatc cccagcatca atcaccagga gcccgataaa 33 00 

cgctatgtgt ggaagtgcct cccccaggga tttgtgctta gcccctacat ttaccagaag 3360 

acacttcaag agatcctcca acctttccgc gaaagatacc cagaggttca actctaccaa 3420 

tatatggacg acctgttcat ggggtccaac gggtctaaga agcagcacaa ggaactcatc 3480 

atcgaactga gggcaatcct cctggagaaa ggcttcgaga cacccgacga caagctgcaa 3540 

gaagttcctc catatagctg gctgggctac cagctttgcc ctgaaaactg gaaagtccag 3600 

aagatgcagt tggatatggt caagaaccca acactgaacg acgtccagaa gctcatgggc 3660 

aatattacct ggatgagctc cggaatccct gggcttaccg ttaagcacat tgccgcaact 3720 
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acaaaaggat gcctggagtt gaaccagaag gtcatttgga cagaggaagc tcagaaggaa 3780 

ctggaggaga ataatgaaaa gattaagaat gctcaagggc tccaatacta caatcccgaa 3 840 

gaagaaatgt tgtgcgaggt cgaaatcact aagaactacg aagccaccta tgtcatcaaa 3900 

cagtcccaag gcatcttgtg ggccggaaag aaaatcatga aggccaacaa aggctggtcc 3 960 

accgttaaaa atctgatgct cctgctccag cacgtcgcca ccgagtctat cacccgcgtc 4020 

ggcaagtgcc ccaccttcaa agttcccttc actaaggagc aggtgatgtg ggagatgcaa 4080 

aaaggctggt actactcttg gcttcccgag atcgtctaca cccaccaagt ggtgcacgac 4140 

gactggagaa tgaagcttgt cgaggagccc actagcggaa ttacaatcta taccgacggc 4200 

ggaaagcaaa acggagaggg aatcgctgca tacgtcacat ctaacggccg caccaagcaa 42 60 

aagaggctcg gccctgtcac tcaccaggtg gctgagagga tggctatcca gatggccctt 4320 

gaggacacta gagacaagca ggtgaacatt gtgactgaca gctactactg ctggaaaaac 4380 

atcacagagg gccttggcct ggagggaccc cagtctccct ggtggcctat catccagaat 444 0 

atccgcgaaa aggaaattgt ctatttcgcc tgggtgcctg gacacaaagg aatttacggc 4500 

aaccaactcg ccgatgaagc cgccaaaatt aaagaggaaa tcatgcttgc ctaccagggc 4560 

acacagatta aggagaagag agacgaggac gctggctttg acctgtgtgt gccatacgac 4620 

atcatgattc ccgttagcga cacaaagatc attccaaccg atgtcaagat ccaggtgcca 4680 

cccaattcat ttggttgggt gaccggaaag tccagcatgg ctaagcaggg tcttctgatt 4740 

aacgggggaa tcattgatga aggatacacc ggcgaaatcc aggtgatctg cacaaatatc 4800 

ggcaaaagca atattaagct tatcgaaggg cagaagttcg ctcaactcat catcctccag 4860 

caccacagca attcaagaca accttgggac gaaaacaaga ttagccagag aggtgacaag 4 92 0 

ggcttcggca gcacaggtgt gttctgggtg gagaacatcc aggaagcaca ggacgagcac 4 980 

gagaattggc acacctcccc taagattttg gcccgcaatt acaagatccc actgactgtg 5040 

gctaagcaga tcacacagga atgcccccac tgcaccaaac aaggttctgg ccccgccggc 5100 

tgcgtgatga ggtcccccaa tcactggcag gcagattgca cccacctcga caacaaaatt 5160 

atcctgacct tcgtggagag caattccggc tacatccacg caacactcct ctccaaggaa 5220 

aatgcattgt gcacctccct cgcaattctg gaatgggcca ggctgttctc tccaaaatcc 5280 

ctgcacaccg acaacggcac caactttgtg gctgaacctg tggtgaatct gctgaagttc 5340 

ctgaaaatcg cccacaccac tggcattccc tatcaccctg aaagccaggg cattgtcgag 5400 

agggccaaca gaactctgaa agaaaagatc caatctcaca gagacaatac acagacattg 5460 

gaggccgcac ttcagctcgc ccttatcacc tgcaacaaag gaagagaaag catgggcggc 5520 

cagaccccct gggaggtctt catcactaac caggcccagg tcatccatga aaagctgctc 5580 

ttgcagcagg cccagtcctc caaaaagttc tgcttttata agatccccgg tgagcacgac 5640 

tggaaaggtc ctacaagagt tttgtggaaa ggagacggcg cagttgtggt gaacgatgag 5700 

ggcaagggga tcatcgctgt gcccctgaca cgcaccaagc ttctcatcaa gccaaactga 5760 

acccggggcg gccgcttccc tttagtgagg gttaatgctt cgagcagaca tgataagata 5820 

cattgatgag tttggacaaa ccacaactag aatgcagtga aaaaaatgct ttatttgtga 5880 

aatttgtgat gctattgctt tatttgtaac cattataagc tgcaataaac aagttaacaa 5940 

caacaattgc attcatttta tgtttcaggt tcagggggag atgtgggagg ttttttaaag 6000 

caagtaaaac ctctacaaat gtggtaaaat ccgataagga tcgatccggg ctggcgtaat 6060 

agcgaagagg cccgcaccga tcgcccttcc caacagttgc gcagcctgaa tggcgaatgg 612 0 

acgcgccctg tagcggcgca ttaagcgcgg cgggtgtggt ggttacgcgc agcgtgaccg 6180 

ctacacttgc cagcgcccta gcgcccgctc ctttcgcttt cttcccttcc tttctcgcca 6240 

cgttcgccgg ctttccccgt caagctctaa atcgggggct ccctttaggg ttccgattta 6300 

gagctttacg gcacctcgac cgcaaaaaac ttgatttggg tgatggttca cgtagtgggc 6360 

catcgccctg atagacggtt tttcgccctt tgacgttgga gtccacgttc tttaatagtg 642 0 

gactcttgtt ccaaactgga acaacactca accctatctc ggtctattct tttgatttat 6480 

aagggatttt gccgatttcg gcctattggt taaaaaatga gctgatttaa caaatattta 6540 

acgcgaattt taacaaaata ttaacgttta caatttcgcc tgatgcggta ttttctcctt 6600 

acgcatctgt gcggtatttc acaccgcata cgcggatctg cgcagcacca tggcctgaaa 6660 

taacctctga aagaggaact tggttaggta ccttctgagg cggaaagaac cagctgtgga 6720 

atgtgtgtca gttagggtgt ggaaagtccc caggctcccc agcaggcaga agtatgcaaa 67 8 0 

gcatgcatct caattagtca gcaaccaggt gtggaaagtc cccaggctcc ccagcaggca 684 0 

gaagtatgca aagcatgcat ctcaattagt cagcaaccat agtcccgccc ctaactccgc 6900 

ccatcccgcc cctaactccg cccagttccg cccattctcc gccccatggc tgactaattt 6960 

tttttattta tgcagaggcc gaggccgcct cggcctctga gctattccag aagtagtgag 7020 

gaggcttttt tggaggccta ggcttttgca aaaagcttga ttcttctgac acaacagtct 7080 

cgaacttaag gctagagcca ccatgattga acaagatgga ttgcacgcag gttctccggc 714 0 

cgcttgggtg gagaggctat tcggctatga ctgggcacaa cagacaatcg gctgctctga 7200 
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tgccgccgtg ttccggctgt cagcgcaggg gcgcccggtt ctttttgtca agaccgacct 7260 

gtccggtgcc ctgaatgaac tgcaggacga ggcagcgcgg ctatcgtggc tggccacgac 7320 

gggcgttcct tgcgcagctg tgctcgacgt tgtcactgaa gcgggaaggg actggctgct 7380 

attgggcgaa gtgccggggc aggatctcct gtcatctcac cttgctcctg ccgagaaagt 744 0 

atccatcatg gctgatgcaa tgcggcggct gcatacgctt gatccggcta cctgcccatt 7500 

cgaccaccaa gcgaaacatc gcatcgagcg agcacgtact cggatggaag ccggtcttgt 7560 

cgatcaggat gatctggacg aagagcatca ggggctcgcg ccagccgaac tgttcgccag 762 0 

gctcaaggcg cgcatgcccg acggcgagga tctcgtcgtg acccatggcg atgcctgctt 7680 

gccgaatatc atggtggaaa atggccgctt ttctggattc atcgactgtg gccggctggg 774 0 

tgtggcggac cgctatcagg acatagcgtt ggctacccgt gatattgctg aagagcttgg 7800 

cggcgaatgg gctgaccgct tcctcgtgct ttacggtatc gccgctcccg attcgcagcg 7860 

catcgccttc tatcgccttc ttgacgagtt cttctgagcg ggactctggg gttcgaaatg 7920 

accgaccaag cgacgcccaa cctgccatca cgatggccgc aataaaatat ctttattttc 7980 

attacatctg tgtgttggtt ttttgtgtga atcgatagcg ataaggatcc gcgtatggtg 804 0 

cactctcagt acaatctgct ctgatgccgc atagttaagc cagccccgac acccgccaac 8100 

acccgctgac gcgccctgac gggcttgtct gctcccggca tccgcttaca gacaagctgt 8160 

gaccgtctcc gggagctgca tgtgtcagag gttttcaccg tcatcaccga aacgcgcgag 8220 

acgaaagggc ctcgtgatac gcctattttt ataggttaat gtcatgataa taatggtttc 8280 

ttagacgtca ggtggcactt ttcggggaaa tgtgcgcgga acccctattt gtttattttt 8340 

ctaaatacat tcaaatatgt atccgctcat gagacaataa ccctgataaa tgcttcaata 8400 

atattgaaaa aggaagagta tgagtattca acatttccgt gtcgccctta ttcccttttt 8460 

tgcggcattt tgccttcctg tttttgctca cccagaaacg ctggtgaaag taaaagatgc 852 0 

tgaagatcag ttgggtgcac gagtgggtta catcgaactg gatctcaaca gcggtaagat 8580 

ccttgagagt tttcgccccg aagaacgttt tccaatgatg agcactttta aagttctgct 864 0 

atgtggcgcg gtattatccc gtattgacgc cgggcaagag caactcggtc gccgcataca 8700 

ctattctcag aatgacttgg ttgagtactc accagtcaca gaaaagcatc ttacggatgg 8760 

catgacagta agagaattat gcagtgctgc cataaccatg agtgataaca ctgcggccaa 8 82 0 

cttacttctg acaacgatcg gaggaccgaa ggagctaacc gcttttttgc acaacatggg 8880 

ggatcatgta actcgccttg atcgttggga accggagctg aatgaagcca taccaaacga 8 94 0 

cgagcgtgac accacgatgc ctgtagcaat ggcaacaacg ttgcgcaaac tattaactgg 9000 

cgaactactt actctagctt cccggcaaca attaatagac tggatggagg cggataaagt 9060 

tgcaggacca cttctgcgct cggcccttcc ggctggctgg tttattgctg ataaatctgg 912 0 

agccggtgag cgtgggtctc gcggtatcat tgcagcactg gggccagatg gtaagccctc 9180 

ccgtatcgta gttatctaca cgacggggag tcaggcaact atggatgaac gaaatagaca 924 0 

gatcgctgag ataggtgcct cactgattaa gcattggtaa ctgtcagacc aagtttactc 93 0 0 

atatatactt tagattgatt taaaacttca tttttaattt aaaaggatct aggtgaagat 9360 

cctttttgat aatctcatga ccaaaatccc ttaacgtgag ttttcgttcc actgagcgtc 9420 

agaccccgta gaaaagatca aaggatcttc ttgagatcct ttttttctgc gcgtaatctg 9480 

ctgcttgcaa acaaaaaaac caccgctacc agcggtggtt tgtttgccgg atcaagagct 9540 

accaactctt tttccgaagg taactggctt cagcagagcg cagataccaa atactgtcct 9600 

tctagtgtag ccgtagttag gccaccactt caagaactct gtagcaccgc ctacatacct 9660 

cgctctgcta atcctgttac cagtggctgc tgccagtggc gataagtcgt gtcttaccgg 9720 

gttggactca agacgatagt taccggataa ggcgcagcgg tcgggctgaa cggggggttc 9780 

gtgcacacag cccagcttgg agcgaacgac ctacaccgaa ctgagatacc tacagcgtga 984 0 

gctatgagaa agcgccacgc ttcccgaagg gagaaaggcg gacaggtatc cggtaagcgg 9900 

cagggtcgga acaggagagc gcacgaggga gcttccaggg ggaaacgcct ggtatcttta 9960 

tagtcctgtc gggtttcgcc acctctgact tgagcgtcga tttttgtgat gctcgtcagg 10020 

ggggcggagc ctatggaaaa acgccagcaa cgcggccttt ttacggttcc tggccttttg 10080 

ctggcctttt gctcacatgg ctcgacagat ct 10112 



<210> 54 
<211> 10114 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: Synthetic 
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oligonucleotide 
<400> 54 

tcaatattgg ccattagcca tattattcat tggttatata gcataaatca atattggcta 60 

ttggccattg catacgttgt atctatatca taatatgtac atttatattg gctcatgtcc 120 

aatatgaccg ccatgttggc attgattatt gactagttat taatagtaat caattaeggg 180 

gtcattagtt catagcccat atatggagtt ccgcgttaca taacttaegg taaatggccc 240 

gcctggctga ccgcccaacg acccccgccc attgaegtea ataatgacgt atgttcccat 300 

agtaacgeca atagggactt tccattgacg tcaatgggtg gagtatttac ggtaaactgc 360 

ccacttggca gtacatcaag tgtatcatat gccaagtccg ccccctattg aegtcaatga 420 

eggtaaatgg cccgcctggc attatgecca gtacatgacc ttaegggact ttcctacttg 480 

gcagtacatc tacgtattag teategctat taccatggtg atgcggtttt ggcagtacac 540 

caatgggcgt ggatagcggt ttgactcacg gggatttcca agtctccacc ecattgaegt 600 

caatgggagt ttgttttggc accaaaatca aegggacttt ccaaaatgtc gtaacaactg 660 

cgatcgcccg ccccgttgac gcaaatgggc ggtaggcgtg tacggtggga ggtctatata 720 

agcagagctc gtttagtgaa ccgtcagatc actagaagct ttattgcggt agtttatcac 780 

agttaaattg etaaegcagt cagtgettet gacacaacag tctcgaactt aagctgcagt 840 

gactctctta aggtagcett gcagaagttg gtcgtgaggc actgggcagg taagtatcaa 900 

ggttacaaga caggtttaag gagaccaata gaaactgggc ttgtcgagac agagaagact 960 

ettgegttte tgataggcac ctattggtct tactgacatc cactttgcct ttctctccac 1020 

aggtgtccac tcccagttca attacagctc ttaaggctag agtacttaat acgactcact 1080 

ataggctaga gaattccagg taagatgggc gatcccctca cctggtccaa agecctgaag 114 0 

aaactggaaa aagtcaccgt tcagggtagc caaaagctta ccacaggcaa ttgcaactgg 12 00 

gcattgtccc tggtggatct tttccacgac actaatttcg ttaaggagaa agattggcaa 1260 

ctcagagacg tgatccccct cttggaggac gtgacccaaa cattgtctgg geaggagege 1320 

gaagctttcg agcgcacctg gtgggccatc agegcagtea aaatggggct gcaaatcaac 13 80 

aacgtggttg aeggtaaage tagctttcaa ctgctccgcg ctaagtacga gaagaaaacc 1440 

gecaacaaga aacaatccga acctagegag gagtacccaa ttatgatcga cggcgccggc 1500 

aataggaact tccgcccact gactcccagg ggctatacca cctgggtcaa caccatccag 1560 

acaaaeggae ttttgaacga agcctcccag aacctgttcg gcatcctgtc tgtggactgc 1620 

acctccgaag aaatgaatgc ttttctcgac gtggtgccag gacaggctgg acagaaacag 1680 

atcctgctcg atgccattga caagatcgcc gacgactggg ataatcgeca ccccctgcca 1740 

aacgcccctc tggtggctcc cccacagggg cctatcccta tgacegctag gttcattagg 1800 

ggactggggg tgccccgcga aegecagatg gagecagcat ttgaccaatt taggcagacc 1860 

tacagacagt ggatcatcga agecatgage gaggggatta aagtcatgat cggaaagccc 192 0 

aaggcacaga acatcaggca gggggccaag gaaccatacc ctgagtttgt cgacaggctt 1980 

ctgtcccaga ttaaatccga aggccaccct caggagatct ccaagttctt gacagacaca 2 040 

ctgactatcc aaaatgcaaa tgaagagtgc agaaacgeca tgaggcacct cagacctgaa 2100 

gataccctgg aggagaaaat gtacgcatgt cgcgacattg gcactaccaa gcaaaagatg 2160 

atgetgeteg ccaaggctct gcaaaccggc ctggctggtc cattcaaagg aggagcactg 2220 

aagggaggtc cattgaaagc tgcacaaaca tgttataatt gtgggaagcc aggacattta 2280 

tctagtcaat gtagagcacc taaagtctgt tttaaatgta aacagectgg acatttctca 2340 

aagcaatgea gaagtgttcc aaaaaacggg aagcaagggg ctcaagggag gccccagaaa 24 00 

caaactttcc cgatacaaca gaagagtcag cacaacaaat ctgttgtaca agagactcct 2460 

cagactcaaa atctgtaccc agatctgagc gaaataaaaa aggaatacaa tgtcaaggag 2 52 0 

aaggatcaag tagaggatct caacctggac agtttgtggg agtaacatac aatctcgaga 2580 

agaggcccac taccatcgtc ctgatcaatg acacccctct taatgtgctg ctggacaccg 2640 

gagccgacac cagcgttctc actactgctc actataacag actgaaatac agaggaagga 2700 

aataccaggg cacaggcatc atcggcgttg gaggcaacgt cgaaaccttt tccactcctg 2760 

tcaccatcaa aaagaagggg agacacatta aaaccagaat gctggtcgcc gacatccccg 2820 

tcaccatcct tggcagagac attctccagg acctgggcgc taaactcgtg ctggcacaac 2880 

tgtctaagga aatcaagttc egcaagateg agctgaaaga gggcacaatg ggtccaaaaa 2940 

tcccccagtg gcccctgacc aaagagaagc ttgagggege taaggaaatc gtgcagcgcc 3 000 

tgctttctga gggcaagatt agegaggeca gcgacaataa cccttacaac agccccatct 3 060 

ttgtgattaa gaaaaggagc ggcaaatgga gactcctgca ggacctgagg gaactcaaca 3120 

agaccgtcca ggteggaact gagatctctc gcggactgcc tcaccccggc ggectgatta 3180 

aatgeaagea catgacagtc cttgacattg gagaegctta ttttaccatc cccctcgatc 3240 

ctgaatttcg cccctatact gcttttacca tccccagcat caatcaccag gageccgata 3300 
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aacgctatgt gtggaagtgc ctcccccagg gatttgtgct tagcccctac atttaccaga 3360 

agacacttca agagatcctc caacctttcc gcgaaagata cccagaggtt caactctacc 3420 

aatatatgga cgacctgttc atggggtcca acgggtctaa gaagcagcac aaggaactca 3480 

tcatcgaact gagggcaatc ctcctggaga aaggcttcga gacacccgac gacaagctgc 3540 

aagaagttcc tccatatagc tggctgggct accagctttg ccctgaaaac tggaaagtcc 3600 

agaagatgca gttggatatg gtcaagaacc caacactgaa cgacgtccag aagctcatgg 3 6 60 

gcaatattac ctggatgagc tccggaatcc ctgggcttac cgttaagcac attgccgcaa 3720 

ctacaaaagg atgcctggag ttgaaccaga aggtcatttg gacagaggaa gctcagaagg 3780 

aactggagga gaataatgaa aagattaaga atgctcaagg gctccaatac tacaatcccg 3 840 

aagaagaaat gttgtgcgag gtcgaaatca ctaagaacta cgaagccacc tatgtcatca 3900 

aacagtccca aggcatcttg tgggccggaa agaaaatcat gaaggccaac aaaggctggt 3 960 

ccaccgttaa aaatctgatg ctcctgctcc agcacgtcgc caccgagtct atcacccgcg 4020 

tcggcaagtg ccccaccttc aaagttccct tcactaagga gcaggtgatg tgggagatgc 40 80 

aaaaaggctg gtactactct tggcttcccg agatcgtcta cacccaccaa gtggtgcacg 4140 

acgactggag aatgaagctt gtcgaggagc ccactagcgg aattacaatc tataccgacg 42 00 

gcggaaagca aaacggagag ggaatcgctg catacgtcac atctaacggc cgcaccaagc 4260 

aaaa gaggct cggccctgtc actcaccagg tggctgagag gatggctatc cagatggccc 4320 

ttgaggacac tagagacaag caggtgaaca ttgtgactga cagctactac tgctggaaaa 43 80 

acatcacaga gggccttggc ctggagggac cccagtctcc ctggtggcct atcatccaga 4440 

atatccgcga aaaggaaatt gtctatttcg cctgggtgcc tggacacaaa ggaatttacg 4500 

gcaaccaact cgccgatgaa gccgccaaaa ttaaagagga aatcatgctt gcctaccagg 4560 

gcacacagat taaggagaag agagacgagg acgctggctt tgacctgtgt gtgccatacg 4620 

acatcatgat tcccgttagc gacacaaaga tcattccaac cgatgtcaag atccaggtgc 4680 

cacccaattc atttggttgg gtgaccggaa agtccagcat ggctaagcag ggtcttctga 4740 

ttaacggggg aatcattgat gaaggataca ccggcgaaat ccaggtgatc tgcacaaata 4800 

tcggcaaaag caatattaag cttatcgaag ggcagaagtt cgctcaactc atcatcctcc 4860 

agcaccacag caattcaaga caaccttggg acgaaaacaa gattagccag agaggtgaca 4 92 0 

agggcttcgg cagcacaggt gtgttctggg tggagaacat ccaggaagca caggacgagc 4980 

acgagaattg gcacacctcc cctaagattt tggcccgcaa ttacaagatc ccactgactg 5040 

tggctaagca gatcacacag gaatgccccc actgcaccaa acaaggttct ggccccgccg 5100 

gctgcgtgat gaggtccccc aatcactggc aggcagattg cacccacctc gacaacaaaa 5160 

ttatcctgac cttcgtggag agcaattccg gctacatcca cgcaacactc ctctccaagg 5220 

aaaatgcatt gtgcacctcc ctcgcaattc tggaatgggc caggctgttc tctccaaaat 5280 

ccctgcacac cgacaacggc accaactttg tggctgaacc tgtggtgaat ctgctgaagt 534 0 

tcctgaaaat cgcccacacc actggcattc cctatcaccc tgaaagccag ggcattgtcg 5400 

agagggccaa cagaactctg aaagaaaaga tccaatctca cagagacaat acacagacat 5460 

tggaggccgc acttcagctc gcccttatca cctgcaacaa aggaagagaa agcatgggcg 552 0 

gccagacccc ctgggaggtc ttcatcacta accaggccca ggtcatccat gaaaagctgc 5580 

tcttgcagca ggcccagtcc tccaaaaagt tctgctttta taagatcccc ggtgagcacg 5640 

'actggaaagg tcctacaaga gttttgtgga aaggagacgg cgcagttgtg gtgaacgatg 5700 

a 999caaggg gatcatcgct gtgcccctga cacgcaccaa gcttctcatc aagccaaact 5760 

gaacccgggg cggccgcttc cctttagtga gggttaatgc ttcgagcaga catgataaga 5820 

tacattgatg agtttggaca aaccacaact agaatgcagt gaaaaaaatg ctttatttgt 5880 

gaaatttgtg atgctattgc tttatttgta accattataa gctgcaataa acaagttaac 5940 

aacaacaatt gcattcattt tatgtttcag gttcaggggg agatgtggga ggttttttaa 6000 

agcaagtaaa acctctacaa atgtggtaaa atccgataag gatcgatccg ggctggcgta 6060 

atagcgaaga ggcccgcacc gatcgccctt cccaacagtt gcgcagcctg aatggcgaat 612 0 

ggacgcgccc tgtagcggcg cattaagcgc ggcgggtgtg gtggttacgc gcagcgtgac 6180 

cgctacactt gccagcgccc tagcgcccgc tcctttcgct ttcttccctt cctttctcgc 6240 

cacgttcgcc ggctttcccc gtcaagctct aaatcggggg ctccctttag ggttccgatt 6300 

tagagcttta cggcacctcg accgcaaaaa acttgatttg ggtgatggtt cacgtagtgg 63 60 

gccatcgccc tgatagacgg tttttcgccc tttgacgttg gagtccacgt tctttaatag 6420 

tggactcttg ttccaaactg gaacaacact caaccctatc tcggtctatt cttttgattt 6480 

ataagggatt ttgccgattt cggcctattg gttaaaaaat gagctgattt aacaaatatt 6540 

taacgcgaat tttaacaaaa tattaacgtt tacaatttcg cctgatgcgg tattttctcc 6600 

ttacgcatct gtgcggtatt tcacaccgca tacgcggatc tgcgcagcac catggcctga 6660 

aataacctct gaaagaggaa cttggttagg taccttctga ggcggaaaga accagctgtg 672 0 

gaatgtgtgt cagttagggt gtggaaagtc cccaggctcc ccagcaggca gaagtatgca 6780 
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aagcatgcat ctcaattagt cagcaaccag gtgtggaaag tccccaggct ccccagcagg 6840 

cagaagtatg caaagcatgc atctcaatta gtcagcaacc atagtcccgc ccctaactcc 6900 

gcccatcccg cccctaactc cgcccagttc cgcccattct ccgccccatg gctgactaat 6960 

tttttttatt tatgcagagg ccgaggccgc ctcggcctct gagctattcc agaagtagtg 7020 

aggaggcttt tttggaggcc taggcttttg caaaaagctt gattcttctg acacaacagt 7080 

ctcgaactta aggctagagc caccatgatt gaacaagatg gattgcacgc aggttctccg 7140 

gccgcttggg tggagaggct attcggctat gactgggcac aacagacaat cggctgctct 72 0 0 

gatgccgccg tgttccggct gtcagcgcag gggcgcccgg ttctttttgt caagaccgac 7260 

ctgtccggtg ccctgaatga actgcaggac gaggcagcgc ggctatcgtg gctggccacg 7320 

acgggcgttc cttgcgcagc tgtgctcgac gttgtcactg aagcgggaag ggactggctg 7380 

ctattgggcg aagtgccggg gcaggatctc ctgtcatctc accttgctcc tgccgagaaa 744 0 

gtatccatca tggctgatgc aatgcggcgg ctgcatacgc ttgatccggc tacctgccca 7500 

ttcgaccacc aagcgaaaca tcgcatcgag cgagcacgta ctcggatgga agccggtctt 7560 

gtcgatcagg atgatctgga cgaagagcat caggggctcg cgccagccga actgttcgcc 762 0 

aggctcaagg cgcgcatgcc cgacggcgag gatctcgtcg tgacccatgg cgatgcctgc 7680 

ttgccgaata tcatggtgga aaatggccgc ttttctggat tcatcgactg tggccggctg 7740 

ggtgtggcgg accgctatca ggacatagcg ttggctaccc gtgatattgc tgaagagctt 7800 

ggcggcgaat gggctgaccg cttcctcgtg ctttacggta tcgccgctcc cgattcgcag 7860 

cgcatcgcct tctatcgcct tcttgacgag ttcttctgag cgggactctg gggttcgaaa 7920 

tgaccgacca agcgacgccc aacctgccat cacgatggcc gcaataaaat atctttattt 7980 

tcattacatc tgtgtgttgg ttttttgtgt gaatcgatag cgataaggat ccgcgtatgg 804 0 

tgcactctca gtacaatctg ctctgatgcc gcatagttaa gccagccccg acacccgcca 8100 

acacccgctg acgcgccctg acgggcttgt ctgctcccgg catccgctta cagacaagct 8160 

gtgaccgtct ccgggagctg catgtgtcag aggttttcac cgtcatcacc gaaacgcgcg 822 0 

agacgaaagg gcctcgtgat acgcctattt ttataggtta atgtcatgat aataatggtt 8280 

tcttagacgt caggtggcac ttttcgggga aatgtgcgcg gaacccctat ttgtttattt 834 0 

ttctaaatac attcaaatat gtatccgctc atgagacaat aaccctgata aatgcttcaa 8400 

taatattgaa aaaggaagag tatgagtatt caacatttcc gtgtcgccct tattcccttt 8460 

tttgcggcat tttgccttcc tgtttttgct cacccagaaa cgctggtgaa agtaaaagat 8520 

gctgaagatc agttgggtgc acgagtgggt tacatcgaac tggatctcaa cagcggtaag 85 8 0 

atccttgaga gttttcgccc cgaagaacgt tttccaatga tgagcacttt taaagttctg 8640 

ctatgtggcg cggtattatc ccgtattgac gccgggcaag agcaactcgg tcgccgcata 8700 

cactattctc agaatgactt ggttgagtac tcaccagtca cagaaaagca tcttacggat 8760 

ggcatgacag taagagaatt atgcagtgct gccataacca tgagtgataa cactgcggcc 8820 

aacttacttc tgacaacgat cggaggaccg aaggagctaa ccgctttttt gcacaacatg 8880 

ggggatcatg taactcgcct tgatcgttgg gaaccggagc tgaatgaagc cataccaaac 8 94 0 

gacgagcgtg acaccacgat gcctgtagca atggcaacaa cgttgcgcaa actattaact 9000 

ggcgaactac ttactctagc ttcccggcaa caattaatag actggatgga ggcggataaa 9060 

gttgcaggac cacttctgcg ctcggccctt ccggctggct ggtttattgc tgataaatct 9120 

ggagccggtg agcgtgggtc tcgcggtatc attgcagcac tggggccaga tggtaagccc 9180 

tcccgtatcg tagttatcta cacgacgggg agtcaggcaa ctatggatga acgaaataga 9240 

cagatcgctg agataggtgc ctcactgatt aagcattggt aactgtcaga ccaagtttac 9300 

tcatatatac tttagattga tttaaaactt catttttaat ttaaaaggat ctaggtgaag 9360 

atcctttttg ataatctcat gaccaaaatc ccttaacgtg agttttcgtt ccactgagcg 9420 

tcagaccccg tagaaaagat caaaggatct tcttgagatc ctttttttct gcgcgtaatc 9480 

tgctgcttgc aaacaaaaaa accaccgcta ccagcggtgg tttgtttgcc ggatcaagag 954 0 

ctaccaactc tttttccgaa ggtaactggc ttcagcagag cgcagatacc aaatactgtc 9600 

cttctagtgt agccgtagtt aggccaccac ttcaagaact ctgtagcacc gcctacatac 9660 

ctcgctctgc taatcctgtt accagtggct gctgccagtg gcgataagtc gtgtcttacc 9720 

gggttggact caagacgata gttaccggat aaggcgcagc ggtcgggctg aacggggggt 97 8 0 

tcgtgcacac agcccagctt ggagcgaacg acctacaccg aactgagata cctacagcgt 9840 

gagctatgag aaagcgccac gcttcccgaa gggagaaagg cggacaggta tccggtaagc 9900 

ggcagggtcg gaacaggaga gcgcacgagg gagcttccag ggggaaacgc ctggtatctt 9960 

tatagtcctg tcgggtttcg ccacctctga cttgagcgtc gatttttgtg atgctcgtca 10020 

ggggggcgga gcctatggaa aaacgccagc aacgcggcct ttttacggtt cctggccttt 10080 

tgctggcctt ttgctcacat ggctcgacag atct 10114 
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<210> 55 
<211> 12473 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
oligonucleotide 

<400> 55 

gttaccttct gctctgcaga atggccaacc tttaacgtcg gatggccgcg agacggcacc 60 

tttaaccgag acctcatcac ccaggttaag atcaaggtct tttcacctgg cccgcatgga 120 

cacccagacc aggtccccta catcgtgacc tgggaagcct tggcttttga cccccctccc 180 

tgggtcaagc cctttgtaca ccctaagcct ccgcctcctc ttcctccatc cgccccgtct 240 

ctcccccttg aacctcctcg ttcgaccccg cctcgatcct ccctttatcc agccctcact 300 

ccttctctag gcgccggaat tcgttaactc gagaggcctg ccaccatggg gactgctcca 360 

aagaagaagc gtaaggtagt cgttttacaa cgtcgtgact gggaaaaccc tggcgttacc 420 

caacttaatc gccttgcagc acatccccct ttcgccagct ggcgtaatag cgaagaggcc 4 80 

cgcaccgatc gcccttccca acagttgcgc agcctgaatg gcgaatggcg ctttgcctgg 540 

tttccggcac cagaagcggt gccggaaagc tggctggagt gcgatcttcc tgaggccgat 600 

actgtcgtcg tcccctcaaa ctggcagatg cacggttacg atgcgcccat ctacaccaac 660 

gtaacctatc ccattacggt caatccgccg tttgttccca cggagaatcc gacgggttgt 72 0 

tactcgctca catttaatgt tgatgaaagc tggctacagg aaggccagac gcgaattatt 780 

tttgatggcg ttaactcggc gtttcatctg tggtgcaacg ggcgctgggt cggttacggc 84 0 

caggacagtc gtttgccgtc tgaatttgac ctgagcgcat ttttacgcgc cggagaaaac 900 

cgcctcgcgg tgatggtgct gcgttggagt gacggcagtt atctggaaga tcaggatatg 96 0 

tggcggatga gcggcatttt ccgtgacgtc tcgttgctgc ataaaccgac tacacaaatc 1020 

agcgatttcc atgttgccac tcgctttaat gatgatttca gccgcgctgt actggaggct 1080 

gaagttcaga tgtgcggcga gttgcgtgac tacctacggg taacagtttc tttatggcag 1140 

ggtgaaacgc aggtcgccag cggcaccgcg cctttcggcg gtgaaattat cgatgagcgt 1200 

ggtggttatg ccgatcgcgt cacactacgt ctgaacgtcg aaaacccgaa actgtggagc 12 60 

gccgaaatcc cgaatctcta tcgtgcggtg gttgaactgc acaccgccga cggcacgctg 1320 

attgaagcag aagcctgcga tgtcggtttc cgcgaggtgc ggattgaaaa tggtctgctg 1380 

ctgctgaacg gcaagccgtt gctgattcga ggcgttaacc gtcacgagca tcatcctctg 1440 

catggtcagg tcatggatga gcagacgatg gtgcaggata tcctgctgat gaagcagaac 1500 

aactttaacg ccgtgcgctg ttcgcattat ccgaaccatc cgctgtggta cacgctgtgc 1560 

gaccgctacg gcctgtatgt ggtggatgaa gccaatattg aaacccacgg catggtgcca 1620 

atgaatcgtc tgaccgatga tccgcgctgg ctaccggcga tgagcgaacg cgtaacgcga 1680 

atggtgcagc gcgatcgtaa tcacccgagt gtgatcatct ggtcgctggg gaatgaatca 174 0 

ggccacggcg ctaatcacga cgcgctgtat cgctggatca aatctgtcga tccttcccgc 1800 

ccggtgcagt atgaaggcgg cggagccgac accacggcca ccgatattat ttgcccgatg 1860 

tacgcgcgcg tggatgaaga ccagcccttc ccggctgtgc cgaaatggtc catcaaaaaa 192 0 

tggctttcgc tacctggaga gacgcgcccg ctgatccttt gcgaatacgc ccacgcgatg 1980 

ggtaacagtc ttggcggttt cgctaaatac tggcaggcgt ttcgtcagta tccccgttta 2040 

cagggcggct tcgtctggga ctgggtggat cagtcgctga ttaaatatga tgaaaacggc 2100 

aacccgtggt cggcttacgg cggtgatttt ggcgatacgc cgaacgatcg ccagttctgt 2160 

atgaacggtc tggtctttgc cgaccgcacg ccgcatccag cgctgacgga agcaaaacac 2220 

cagcagcagt ttttccagtt ccgtttatcc gggcaaacca tcgaagtgac cagcgaatac 2280 

ctgttccgtc atagcgataa cgagctcctg cactggatgg tggcgctgga tggtaagccg 2340 

ctggcaagcg gtgaagtgcc tctggatgtc gctccacaag gtaaacagtt gattgaactg 2400 

cctgaactac cgcagccgga gagcgccggg caactctggc tcacagtacg cgtagtgcaa 2460 

ccgaacgcga ccgcatggtc agaagccggg cacatcagcg cctggcagca gtggcgtctg 2520 

gcggaaaacc tcagtgtgac gctccccgcc gcgtcccacg ccatcccgca tctgaccacc 2580 

agcgaaatgg atttttgcat cgagctgggt aataagcgtt ggcaatttaa ccgccagtca 2640 

ggctttcttt cacagatgtg gattggcgat aaaaaacaac tgctgacgcc gctgcgcgat 2700 

cagttcaccc gtgcaccgct ggataacgac attggcgtaa gtgaagcgac ccgcattgac 2 760 

cctaacgcct gggtcgaacg ctggaaggcg gcgggccatt accaggccga agcagcgttg 2 82 0 

ttgcagtgca cggcagatac acttgctgat gcggtgctga ttacgaccgc tcacgcgtgg 2880 
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cagcatcagg ggaaaacctt atttatcagc cggaaaacct accggattga tggtagtggt 2940 

caaatggcga ttaccgttga tgttgaagtg gcgagcgata caccgcatcc ggcgcggatt 3000 

ggcctgaact gccagctggc gcaggtagca gagcgggtaa actggctcgg attagggccg 3060 

caagaaaact atcccgaccg ccttactgcc gcctgttttg accgctggga tctgccattg 3120 

tcagacatgt ataccccgta cgtcttcccg agcgaaaacg gtctgcgctg cgggacgcgc 3180 

gaattgaatt atggcccaca ccagtggcgc ggcgacttcc agttcaacat cagccgctac 3240 

agtcaacagc aactgatgga aaccagccat cgccatctgc tgcacgcgga agaaggcaca 3300 

tggctgaata tcgacggttt ccatatgggg attggtggcg acgactcctg gagcccgtca 3360 

gtatcggcgg aattccagct gagcgccggt cgctaccatt accagttggt ctggtgtcaa 342 0 

aaataataat aaccgggcag gggggatccg cagatccggc tgtggaatgt gtgtcagtta 3480 

gggtgtggaa agtccccagg ctccccagca ggcagaagta tgcaaagcat gcctgcagga 3540 

gtggggaggc acgatggccg ctttggtcga ggcggatccg gccattagcc atattattca 3600 

ttggttatat agcataaatc aatattggct attggccatt gcatacgttg tatccatatc 3660 

ataatatgta catttatatt ggctcatgtc caacattacc gccatgttga cattgattat 3720 

tgactagtta ttaatagtaa tcaattacgg ggtcattagt tcatagccca tatatggagt 3780 

tccgcgttac ataacttacg gtaaatggcc cgcctggctg accgcccaac gacccccgcc 3840 

cattgacgtc aataatgacg tatgttccca tagtaacgcc aatagggact ttccattgac 3900 

gtcaatgggt ggagtattta cggtaaactg cccacttggc agtacatcaa gtgtatcata 3 960 

tgccaagtac gccccctatt gacgtcaatg acggtaaatg gcccgcctgg cattatgccc 4020 

agtacatgac cttatgggac tttcctactt ggcagtacat ctacgtatta gtcatcgcta 4080 

ttaccatggt gatgcggttt tggcagtaca tcaatgggcg tggatagcgg tttgactcac 4140 

ggggatttcc aagtctccac cccattgacg tcaatgggag tttgttttgg caccaaaatc 4200 

aacgggactt tccaaaatgt cgtaacaact ccgccccatt gacgcaaatg ggcggtaggc 4260 

atgtacggtg ggaggtctat ataagcagag ctcgtttagt gaaccgtcag atcgcctgga 4320 

gacgccatcc acgctgtttt gacctccata gaagacaccg ggaccgatcc agcctccgcg 4380 

gccccaagct tgttgggatc caccggtcgc caccatggtg agcaagggcg aggagctgtt 4440 

caccggggtg gtgcccatcc tggtcgagct ggacggcgac gtaaacggcc acaagttcag 4500 

cgtgtccggc gagggcgagg gcgatgccac ctacggcaag ctgaccctga agttcatctg 4560 

caccaccggc aagctgcccg tgccctggcc caccctcgtg accaccctga cctacggcgt 4620 

gcagtgcttc agccgctacc ccgaccacat gaagcagcac gacttcttca agtccgccat 4680 

gcccgaaggc tacgtccagg agcgcaccat cttcttcaag gacgacggca actacaagac 4740 

ccgcgccgag gtgaagttcg agggcgacac cctggtgaac cgcatcgagc tgaagggcat 4800 

cgacttcaag gaggacggca acatcctggg gcacaagctg gagtacaact acaacagcca 4860 

caacgtctat atcatggccg acaagcagaa gaacggcatc aaggtgaact tcaagatccg 4 92 0 

ccacaacatc gaggacggca gcgtgcagct cgccgaccac taccagcaga acacccccat 4980 

cggcgacggc cccgtgctgc tgcccgacaa ccactacctg agcacccagt ccgccctgag 504 0 

caaagacccc aacgagaagc gcgatcacat ggtcctgctg gagttcgtga ccgccgccgg 5100 

gatcactctc ggcatggacg agctgtacaa gtaaagcggc cgcgactcta gatcataatc 5160 

agccatacca catttgtaga ggttttactt gctttaaaaa acctcccaca cctccccctg 5220 

aacctgaaac ataaaatgaa tgcaattgtt gttgttaaca tcgataaaat aaaagatttt 5280 

atttagtctc cagaaaaagg ggggaatgaa agaccccacc tgtaggtttg gcaagctagc 5340 

ttaagtaacg ccattttgca aggcatggaa aaatacataa ctgagaatag agaagttcag 5400 

atcaaggtca ggaacagatg gaacagctga atatgggcca aacaggatat ctgtggtaag 5460 

cagttcctgc cccggctcag ggccaagaac agatggaaca gctgaatatg ggccaaacag 5520 

gatatctgtg gtaagcagtt cctgccccgg ctcagggcca agaacagatg gtccccagat 5580 

gcggtccagc cctcagcagt ttctagagaa ccatcagatg tttccagggt gccccaagga 5640 

cctgaaatga ccctgtgcct tatttgaact aaccaatcag ttcgcttctc gcttctgttc 5700 

gcgcgcttct gctccccgag ctcaataaaa gagcccacaa cccctcactc ggggcgccag 5760 

tcctccgatt gactgagtcg cccgggtacc cgtgtatcca ataaaccctc ttgcagttgc 5820 

atccgacttg tggtctcgct gttccttggg agggtctcct ctgagtgatt gactacccgt 5880 

cagcgggggt ctttcatttg ggggctcgtc cgggatcggg agacccctgc ccagggacca 5940 

ccgacccacc accgggaggt aagctggcca gcaacttatc tgtgtctgtc cgattgtcta 6000 

gtgtctatga ctgattttat gcgcctgcgt cggtactagt tagctaacta gctctgtatc 6060 

tggcggaccc gtggtggaac tgacgagttc ggaacacccg gccgcaaccc tgggagagga 612 0 

attctcatgt ttgacagctt atcatcgata agctttttgc aaaagcctag gcctccaaaa 6180 

aagcctcctc actacttctg gaatagctca gaggccgagg cggcctcggc ctctgcataa 6240 

ataaaaaaaa ttagtcagcc atggggcgga gaatgggcgg aactgggcgg agttaggggc 6300 

gggatgggcg gagttagggg cgggactatg gttgctgact aattgagatg catgctttgc 6360 
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atacttctgc ctgctgggga gcctggggac tttccacacc tggttgctga ctaattgaga 642 0 

tgcatgcttt gcatacttct gcctgctggg gagcctgggg actttccaca ccctaactga, 6480 

cacacattcc acagccggat cctctacgcc ggacgcatcg tggccggcat caccggcgcc 6540 

acaggtgcgg ttgctggcgc ctatatcgcc gacatcaccg atggggaaga tcgggctcgc 6600 

cacttcgggc tcatgagcgc ttgtttcggc gtgggtatgg tggcaggccc cgtggccggg 6660 

ggactgttgg gcgccatctc cttgcatgca ccattccttg cggcggcggt gctcaacggc 6720 

ctcaacctac tactgggctg cttcctaatg caggagtcgc ataagggaga gcgtcgaccg 6780 

atgcccttga gagccttcaa cccagtcagc tccttccggt gggcgcgggg catgactatc 6840 

gtcgccgcac ttatgactgt cttctttatc atgcaactcg taggacaggt gccggcagcg 6900 

ctctgggtca ttttcggcga ggaccgcttt cgctggagcg cgacgatgat cggcctgtcg 6960 

cttgcggtat tcggaatctt gcacgccctc gctcaagcct tcgtcactgg tcccgccacc 7020 

aaacgtttcg gcgagaagca ggccattatc gccggcatgg cggccgacgc gctgggctac 7080 

gtcttgctgg cgttcgcgac gcgaggctgg atggccttcc ccattatgat tcttctcgct 7140 

tccggcggca tcgggatgcc cgcgttgcag gccatgctgt ccaggcaggt agatgacgac 7200 

catcagggac agcttcaagg atcgctcgcg gctcttacca gcctaacttc gatcactgga 7260 

ccgctgatcg tcacggcgat ttatgccgcc tcggcgagca catggaacgg gttggcatgg 7320 

attgtaggcg ccgccctata ccttgtctgc ctccccgcgt tgcgtcgcgg tgcatggagc 73 80 

cgggccacct cgacctgaat ggaagccggc ggcacctcgc taacggattc accactccaa 7440 

gaattggagc caatcaattc ttgcggagaa ctgtgaatgc gcaaaccaac ccttggcaga 7500 

acatatccat cgcgtccgcc atctccagca gccgcacgcg gcgcatctcg ggcagcgttg 7560 

ggtcctggcc acgggtgcgc atgatcgtgc tcctgtcgtt gaggacccgg ctaggctggc 7620 

ggggttgcct tactggttag cagaatgaat caccgatacg cgagcgaacg tgaagcgact 7680 

gctgctgcaa aacgtctgcg acctgagcaa caacatgaat ggtcttcggt ttccgtgttt 7740 

cgtaaagtct ggaaacgcgg aagtcagcgc cctgcaccat tatgttccgg atctgcatcg 7800 

caggatgctg ctggctaccc tgtggaacac ctacatctgt attaacgaag cgctggcatt 7860 

gaccctgagt gatttttctc tggtcccgcc gcatccatac cgccagttgt ttaccctcac 7920 

aacgttccag taaccgggca tgttcatcat cagtaacccg tatcgtgagc atcctctctc 7980 

gtttcatcgg tatcattacc cccatgaaca gaaattcccc cttacacgga ggcatcaagt 8040 

gaccaaacag gaaaaaaccg cccttaacat ggcccgcttt atcagaagcc agacattaac 8100 

gcttctggag aaactcaacg agctggacgc ggatgaacag gcagacatct gtgaatcgct 8160 

tcacgaccac gctgatgagc tttaccgcag ctgcctcgcg cgtttcggtg atgacggtga 8220 

aaacctctga cacatgcagc tcccggagac ggtcacagct tgtctgtaag cggatgccgg 82 80 

gagcagacaa gcccgtcagg gcgcgtcagc gggtgttggc gggtgtcggg gcgcagccat 834 0 

gacccagtca cgtagcgata gcggagtgta tactggctta actatgcggc atcagagcag 8400 

attgtactga gagtgcacca tatgcggtgt gaaataccgc acagatgcgt aaggagaaaa 8460 

taccgcatca ggcgctcttc cgcttcctcg ctcactgact cgctgcgctc ggtcgttcgg 8520 

ctgcggcgag cggtatcagc tcactcaaag gcggtaatac ggttatccac agaatcaggg 8580 

gataacgcag gaaagaacat gtgagcaaaa ggccagcaaa aggccaggaa ccgtaaaaag 8640 

gccgcgttgc tggcgttttt ccataggctc cgcccccctg acgagcatca caaaaatcga 8700 

cgctcaagtc agaggtggcg aaacccgaca ggactataaa gataccaggc gtttccccct 8760 

ggaagctccc tcgtgcgctc tcctgttccg accctgccgc ttaccggata cctgtccgcc 8820 

tttctccctt cgggaagcgt ggcgctttct catagctcac gctgtaggta tctcagttcg 8880 

gtgtaggtcg ttcgctccaa gctgggctgt gtgcacgaac cccccgttca gcccgaccgc 8940 

tgcgccttat ccggtaacta tcgtcttgag tccaacccgg taagacacga cttatcgcca 9000 

ctggcagcag ccactggtaa caggattagc agagcgaggt atgtaggcgg tgctacagag 9060 

ttcttgaagt ggtggcctaa ctacggctac actagaagga cagtatttgg tatctgcgct 912 0 

ctgctgaagc cagttacctt cggaaaaaga gttggtagct cttgatccgg caaacaaacc 9180 

accgctggta gcggtggttt ttttgtttgc aagcagcaga ttacgcgcag aaaaaaagga 924 0 

tctcaagaag atcctttgat cttttctacg gggtctgacg ctcagtggaa cgaaaactca 9300 

cgttaaggga ttttggtcat gagattatca aaaaggatct tcacctagat ccttttaaat 9360 

taaaaatgaa gttttaaatc aatctaaagt atatatgagt aaacttggtc tgacagttac 942 0 

caatgcttaa tcagtgaggc acctatctca gcgatctgtc tatttcgttc atccatagtt 9480 

gcctgactcc ccgtcgtgta gataactacg atacgggagg gcttaccatc tggccccagt 9540 

gctgcaatga taccgcgaga cccacgctca ccggctccag atttatcagc aataaaccag 9600 

ccagccggaa gggccgagcg cagaagtggt cctgcaactt tatccgcctc catccagtct 9660 

attaattgtt gccgggaagc tagagtaagt agttcgccag ttaatagttt gcgcaacgtt 972 0 

gttgccattg ctgcaggcat cgtggtgtca cgctcgtcgt ttggtatggc ttcattcagc 9780 

tccggttccc aacgatcaag gcgagttaca tgatccccca tgttgtgcaa aaaagcggtt 9840 



.1. o o e a o y * o ? ;t. g o e 



44 



agctccttcg gtcctccgat cgtcgttgtc agaagtaagt tggccgcagt gttatcactc 9900 

atggttatgg cagcactgca taattctctt actgtcatgc catccgtaag atgcttttct 9960 

gtgactggtg agtactcaac caagtcattc tgagaatagt gtatgcggcg accgagttgc 1002 0 

tcttgcccgg cgtcaatacg ggataatacc gcgccacata gcagaacttt aaaagtgctc 10080 

atcattggaa aacgttcttc ggggcgaaaa ctctcaagga tcttaccgct gttgagatcc 10140 

agttcgatgt aacccactcg tgcacccaac tgatcttcag catcttttac tttcaccagc 10200 

gtttctgggt gagcaaaaac aggaaggcaa aatgccgcaa aaaagggaat aagggcgaca 10260 

cggaaatgtt gaatactcat actcttcctt tttcaatatt attgaagcat ttatcagggt 10320 

tattgtctca tgagcggata catatttgaa tgtatttaga aaaataaaca aataggggtt 10380 

ccgcgcacat ttccccgaaa agtgccacct gacgtctaag aaaccattat tatcatgaca 10440 

ttaacctata aaaataggcg tatcacgagg ccctttcgtc tcgcgcgttt cggtgatgac 10500 

ggtgaaaacc tctgacacat gcagctcccg gagacggtca cagcttgtct gtaagcggat 10560 

gccgggagca gacaagcccg tcagggcgcg tcagcgggtg ttggcgggtg tcggggctgg 1062 0 

cttaactatg cggcatcaga gcagattgta ctgagagtgc accatatcga cgctctccct 10680 

tatgcgactc ctgcattagg aagcagccca gtagtaggtt gaggccgttg agcaccgccg 10740 

ccgcaaggaa tggtgcatgc aaggagatgg cgcccaacag tcccccggcc acggggcctg 10800 

ccaccatacc cacgccgaaa caagcgctca tgagcccgaa gtggcgagcc cgatcttccc 10860 

catcggtgat gtcggcgata taggcgccag caaccgcacc tgtggcgccg gtgatgccgg 1092 0 

ccacgatgcg tccggcgtag aggatctggc tagcgatgac cctgctgatt ggttcgctga 10980 

ccatttccgg ggtgcggaac ggcgttacca gaaactcaga aggttcgtcc aaccaaaccg 11040 

actctgacgg cagtttacga gagagatgat agggtctgct tcagtaagcc agatgctaca 11100 

caattaggct tgtacatatt gtcgttagaa cgcggctaca attaatacat aaccttatgt 11160 

atcatacaca tacgatttag gtgacactat agaatacaag ctggaagatc ttccagcttg 11220 

ggctgcaggt cgactctaga gtccgttaca taacttacgg taaatggccc gcctggctga 11280 

ccgcccaacg acccccgccc attgacgtca ataatgacgt atgttcccat agtaacgcca 11340 

atagggactt tccattgacg tcaatgggtg gagtatttac ggtaaactgc ccacttggca 11400 

gtacatcaag tgtatcatat gccaagtacg ccccctattg acgtcaatga cggtaaatgg 11460 

cccgcctggc attatgccca gtacatgacc ttatgggact ttcctacttg gcagtacatc 11520 

tacgtattag tcatcgctat taccatggtg atgcggtttt ggcagtacat caatgggcgt 11580 

ggatagcggt ttgactcacg gggatttcca agtctccacc ccattgacgt caatgggagt 11640 

ttgttttggc accaaaatca acgggacttt ccaaaatgtc gtaacaactc cgccccattg 11700 

acgcaaatgg gcggtaggcg tgtacggtgg gaggtctata taagcagagc tcgtttagtg 11760 

aaccgcgcca gtcttccgat agactgcgtc gcccgggtac ccgtattccc aataaagcct 11820 

cttgctgttt gcatccgaat cgtggtctcg ctgttccttg ggagggtctc ctctgagtga 11880 

ttgactaccc acgacggggg tctttcattt gggggctcgt ccgggatttg gagacccctg 11940 

cccagggacc accgacccac caccgggagg taagctggcc agcaacttat ctgtgtctgt 12000 

ccgattgtct agtgtctatg tttgatgtta tgcgcctgcg tctgtactag ttagctaact 12060 

agctctgtat ctggcggacc cgtggtggaa ctgacgagtt ctgaacaccc ggccgcaacc 1212 0 

ctgggagacg tcccagggac tttgggggcc gtttttgtgg cccgacctga ggaagggagt 12180 

cgatgtggaa tccgaccccg tcaggatatg tggttctggt aggagacgag aacctaaaac 12240 

agttcccgcc tccgtctgaa tttttgcttt cggtttggaa ccgaagccgc gcgtcttgtc 12300 

tgctgcagcg ctgcagcatc gttctgtgtt gtctctgtct gactgtgttt ctgtatttgt 12360 

ctgaaaatta gggccagact gttaccactc ccttaagttt gaccttaggt cactggaaag 12420 

atgtcgagcg gatcgctcac aaccagtcgg tagatgtcaa gaagagacgt tgg 12473 



<210> 56 
<211> 9729 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
oligonucleotide 

<400> 56 

gttaccttct gctctgcaga atggccaacc tttaacgtcg gatggccgcg agacggcacc 60 
tttaaccgag acctcatcac ccaggttaag atcaaggtct tttcacctgg cccgcatgga 120 
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cacccagacc aggtccccta catcgtgacc tgggaagcct tggcttttga cccccctccc 180 

tgggtcaagc cctttgtaca ccctaagcct ccgcctcctc ttcctccatc cgccccgtct 240 

ctcccccttg aacctcctcg ttcgaccccg cctcgatcct ccctttatcc agccctcact 300 

ccttctctag gcgccggaat tcgttaactc gagaggcctg ccaccatggg gactgctcca 360 

aagaagaagc gtaaggtagt cgttttacaa cgtcgtgact gggaaaaccc tggcgttacc 42 0 

caacttaatc gccttgcagc acatccccct ttcgccagct ggcgtaatag cgaagaggcc 480 

cgcaccgatc gcccttccca acagttgcgc agcctgaatg gcgaatggcg ctttgcctgg 540 

tttccggcac cagaagcggt gccggaaagc tggctggagt gcgatcttcc tgaggccgat 600 

actgtcgtcg tcccctcaaa ctggcagatg cacggttacg atgcgcccat ctacaccaac 660 

gtaacctatc ccattacggt caatccgccg tttgttccca cggagaatcc gacgggttgt 72 0 

tactcgctca catttaatgt tgatgaaagc tggctacagg aaggccagac gcgaattatt 780 

tttgatggcg ttaactcggc gtttcatctg tggtgcaacg ggcgctgggt cggttacggc 84 0 

caggacagtc gtttgccgtc tgaatttgac ctgagcgcat ttttacgcgc cggagaaaac 900 

cgcctcgcgg tgatggtgct gcgttggagt gacggcagtt atctggaaga tcaggatatg 960 

tggcggatga gcggcatttt ccgtgacgtc tcgttgctgc ataaaccgac tacacaaatc 102 0 

agcgatttcc atgttgccac tcgctttaat gatgatttca gccgcgctgt actggaggct 1080 

gaagttcaga tgtgcggcga gttgcgtgac tacctacggg taacagtttc tttatggcag 1140 

ggtgaaacgc aggtcgccag cggcaccgcg cctttcggcg gtgaaattat cgatgagcgt 1200 

ggtggttatg ccgatcgcgt cacactacgt ctgaacgtcg aaaacccgaa actgtggagc 1260 

gccgaaatcc cgaatctcta tcgtgcggtg gttgaactgc acaccgccga cggcacgctg 1320 

attgaagcag aagcctgcga tgtcggtttc cgcgaggtgc ggattgaaaa tggtctgctg 1380 

ctgctgaacg gcaagccgtt gctgattcga ggcgttaacc gtcacgagca tcatcctctg 1440 

catggtcagg tcatggatga gcagacgatg gtgcaggata tcctgctgat gaagcagaac 1500 

aactttaacg ccgtgcgctg ttcgcattat ccgaaccatc cgctgtggta cacgctgtgc 1560 

gaccgctacg gcctgtatgt ggtggatgaa gccaatattg aaacccacgg catggtgcca 162 0 

atgaatcgtc tgaccgatga tccgcgctgg ctaccggcga tgagcgaacg cgtaacgcga 1680 

atggtgcagc gcgatcgtaa tcacccgagt gtgatcatct ggtcgctggg gaatgaatca 174 0 

ggccacggcg ctaatcacga cgcgctgtat cgctggatca aatctgtcga tccttcccgc 1800 

ccggtgcagt atgaaggcgg cggagccgac accacggcca ccgatattat ttgcccgatg 1860 

tacgcgcgcg tggatgaaga ccagcccttc ccggctgtgc cgaaatggtc catcaaaaaa 192 0 

tggctttcgc tacctggaga gacgcgcccg ctgatccttt gcgaatacgc ccacgcgatg 1980 

ggtaacagtc ttggcggttt cgctaaatac tggcaggcgt ttcgtcagta tccccgttta 2 04 0 

cagggcggct tcgtctggga ctgggtggat cagtcgctga ttaaatatga tgaaaacggc 2100 

aacccgtggt cggcttacgg cggtgatttt ggcgatacgc cgaacgatcg ccagttctgt 2160 

atgaacggtc tggtctttgc cgaccgcacg ccgcatccag cgctgacgga agcaaaacac 2220 

cagcagcagt ttttccagtt ccgtttatcc gggcaaacca tcgaagtgac cagcgaatac 2280 

ctgttccgtc atagcgataa cgagctcctg cactggatgg tggcgctgga tggtaagccg 2340 

ctggcaagcg gtgaagtgcc tctggatgtc gctccacaag gtaaacagtt gattgaactg 2400 

cctgaactac cgcagccgga gagcgccggg caactctggc tcacagtacg cgtagtgcaa 2460 

ccgaacgcga ccgcatggtc agaagccggg cacatcagcg cctggcagca gtggcgtctg 2520 

gcggaaaacc tcagtgtgac gctccccgcc gcgtcccacg ccatcccgca tctgaccacc 2580 

agcgaaatgg atttttgcat cgagctgggt aataagcgtt ggcaatttaa ccgccagtca 2640 

ggctttcttt cacagatgtg gattggcgat aaaaaacaac tgctgacgcc gctgcgcgat 2700 

cagttcaccc gtgcaccgct ggataacgac attggcgtaa gtgaagcgac ccgcattgac 2760 

cctaacgcct gggtcgaacg ctggaaggcg gcgggccatt accaggccga agcagcgttg 2 82 0 

ttgcagtgca cggcagatac acttgctgat gcggtgctga ttacgaccgc tcacgcgtgg 2880 

cagcatcagg ggaaaacctt at t tat cage eggaaaaect aceggattga tggtagtggt 2 94 0 

caaatggega ttaccgttga tgttgaagtg gegagegata caccgcatcc ggegeggatt 3000 

ggectgaact gccagctggc gcaggtagca gagegggtaa actggctegg attagggecg 3060 

caagaaaact atcccgaccg ccttactgcc gcctgttttg accgctggga tetgecattg 3120 

tcagacatgt ataccccgta cgtcttcccg agegaaaacg gtctgcgctg cgggacgcgc 3180 

gaattgaatt atggcccaca ccagtggcgc ggcgacttcc agttcaacat cagccgctac 3240 

agtcaacagc aactgatgga aaccagccat cgccatctgc tgcacgcgga agaaggcaca 3 3 00 

tggctgaata tcgacggttt ccatatgggg attggtggcg acgactcctg gagcccgtca 3360 

gtateggegg aattccagct gagegceggt cgctaccatt accagttggt ctggtgtcaa 3420 

aaataataat aacegggcag gggggatccg cagatccggc tgtggaatgt gtgtcagtta 34 80 

gggtgtggaa agtccccagg ctccccagca ggcagaagta tgcaaagcat gectgeagga 3540 

gtggggaggc acgatggccg ctttggtcga ggcggatccg gccattagcc atattattca 3600 
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ttggttatat agcataaatc aatattggct attggccatt gcatacgttg tatccatatc 3660 

ataatatgta catttatatt ggctcatgtc caacattacc gccatgttga cattgattat 3720 

tgactagtta ttaatagtaa tcaattacgg ggtcattagt tcatagccca tatatggagt 3780 

tccgcgttac ataacttacg gtaaatggcc cgcctggctg accgcccaac gacccccgcc 3 840 

cattgacgtc aataatgacg tatgttccca tagtaacgcc aatagggact ttccattgac 3900 

gtcaatgggt ggagtattta cggtaaactg cccacttggc agtacatcaa gtgtatcata 3 960 

tgccaagtac gccccctatt gacgtcaatg acggtaaatg gcccgcctgg cattatgccc 4020 

agtacatgac cttatgggac tttcctactt ggcagtacat ctacgtatta gtcatcgcta 4080 

ttaccatggt gatgcggttt tggcagtaca tcaatgggcg tggatagcgg tttgactcac 414 0 

ggggatttcc aagtctccac cccattgacg tcaatgggag tttgttttgg caccaaaatc 4200 

aacgggactt tccaaaatgt cgtaacaact ccgccccatt gacgcaaatg ggcggtaggc 4260 

atgtacggtg ggaggtctat ataagcagag ctcgtttagt gaaccgtcag atcgcctgga 4320 

gacgccatcc acgctgtttt gacctccata gaagacaccg ggaccgatcc agcctccgcg 4380 

gccccaagct tgttgggatc caccggtcgc caccatggtg agcaagggcg aggagctgtt 444 0 

caccggggtg gtgcccatcc tggtcgagct ggacggcgac gtaaacggcc acaagttcag 450 0 

cgtgtccggc gagggcgagg gcgatgccac ctacggcaag ctgaccctga agttcatctg 4560 

caccaccggc aagctgcccg tgccctggcc caccctcgtg accaccctga cctacggcgt 4620 

gcagtgcttc agccgctacc ccgaccacat gaagcagcac gacttcttca agtccgccat 4680 

gcccgaaggc tacgtccagg agcgcaccat cttcttcaag gacgacggca actacaagac 4740 

ccgcgccgag gtgaagttcg agggcgacac cctggtgaac cgcatcgagc tgaagggcat 4 80 0 

cgacttcaag gaggacggca acatcctggg gcacaagctg gagtacaact acaacagcca 4860 

caacgtctat atcatggccg acaagcagaa gaacggcatc aaggtgaact tcaagatccg 4 92 0 

ccacaacatc gaggacggca gcgtgcagct cgccgaccac taccagcaga acacccccat 4980 

cggcgacggc cccgtgctgc tgcccgacaa ccactacctg agcacccagt ccgccctgag 5040 

caaagacccc aacgagaagc gcgatcacat ggtcctgctg gagttcgtga ccgccgccgg 5100 

gatcactctc ggcatggacg agctgtacaa gtaaagcggc cgcgactcta gatcataatc 5160 

agccatacca catttgtaga ggttttactt gctttaaaaa acctcccaca cctccccctg 5220 

aacctgaaac ataaaatgaa tgcaattgtt gttgttaaca tcgataaaat aaaagatttt 5280 

atttagtctc cagaaaaagg ggggaatgaa agaccccacc tgtaggtttg gcaagctagc 5340 

ataacttcgt ataatgtatg ctatacgaag ttattctaga gaaccatcag atgtttccag 5400 

ggtgccccaa ggacctgaaa tgaccctgtg ccttatttga actaaccaat cagttcgctt 5460 

ctcgcttctg ttcgcgcgct tctgctcccc gagctcaata aaagagccca caacccctca 5520 

ctcggggcgc cagtcctccg attgactgag tcgcccgggt acccgtgtat ccaataaacc 5580 

ctcttgcagt tgcatccgac ttgtggtctc gctgttcctt gggagggtct cctctgagtg 564 0 

attgactacc cgtcagcggg ggtctttcat ttgggggctc gtccgggatc gggagacccc 5700 

tgcccaggga ccaccgaccc accaccggga ggtaagctgg ctgcctcgcg cgtttcggtg 5760 

atgacggtga aaacctctga cacatgcagc tcccggagac ggtcacagct tgtctgtaag 5820 

cggatgccgg gagcagacaa gcccgtcagg gcgcgtcagc gggtgttggc gggtgtcggg 5880 

gcgcagccat gacccagtca cgtagcgata gcggagtgta tactggctta actatgcggc 5940 

atcagagcag attgtactga gagtgcacca tatgcggtgt gaaataccgc acagatgcgt 6000 

aaggagaaaa taccgcatca ggcgctcttc cgcttcctcg ctcactgact cgctgcgctc 6060 

ggtcgttcgg ctgcggcgag cggtatcagc tcactcaaag gcggtaatac ggttatccac 6120 

agaatcaggg gataacgcag gaaagaacat gtgagcaaaa ggccagcaaa aggccaggaa 6180 

ccgtaaaaag gccgcgttgc tggcgttttt ccataggctc cgcccccctg acgagcatca 6240 

caaaaatcga cgctcaagtc agaggtggcg aaacccgaca ggactataaa gataccaggc 6300 

gtttccccct ggaagctccc tcgtgcgctc tcctgttccg accctgccgc ttaccggata 6360 

cctgtccgcc tttctccctt cgggaagcgt ggcgctttct catagctcac gctgtaggta 6420 

tctcagttcg gtgtaggtcg ttcgctccaa gctgggctgt gtgcacgaac cccccgttca 6480 

gcccgaccgc tgcgccttat ccggtaacta tcgtcttgag tccaacccgg taagacacga 6540 

cttatcgcca ctggcagcag ccactggtaa caggattagc agagcgaggt atgtaggcgg 6600 

tgctacagag ttcttgaagt ggtggcctaa ctacggctac actagaagga cagtatttgg 6660 

tatctgcgct ctgctgaagc cagttacctt cggaaaaaga gttggtagct cttgatccgg 672 0 

caaacaaacc accgctggta gcggtggttt ttttgtttgc aagcagcaga ttacgcgcag 6780 

aaaaaaagga tctcaagaag atcctttgat cttttctacg gggtctgacg ctcagtggaa 6840 

cgaaaactca cgttaaggga ttttggtcat gagattatca aaaaggatct tcacctagat 6900 

ccttttaaat taaaaatgaa gttttaaatc aatctaaagt atatatgagt aaacttggtc 6960 

tgacagttac caatgcttaa tcagtgaggc acctatctca gcgatctgtc tatttcgttc 7020 

atccatagtt gcctgactcc ccgtcgtgta gataactacg atacgggagg gcttaccatc 7080 
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tggccccagt gctgcaatga taccgcgaga cccacgctca ccggctccag atttatcagc 7140 

aataaaccag ccagccggaa gggccgagcg cagaagtggt cctgcaactt tatccgcctc 72 00 

catccagtct attaattgtt gccgggaagc tagagtaagt agttcgccag ttaatagttt 72 60 

gcgcaacgtt gttgccattg ctgcaggcat cgtggtgtca cgctcgtcgt ttggtatggc 73 2 0 

ttcattcagc tccggttccc aacgatcaag gcgagttaca tgatccccca tgttgtgcaa 73 80 

aaaagcggtt agctccttcg gtcctccgat cgttgtcaga agtaagttgg ccgcagtgtt 7440 

atcactcatg gttatggcag cactgcataa ttctcttact gtcatgccat ccgtaagatg 7500 

cttttctgtg actggtgagt actcaaccaa gtcattctga gaatagtgta tgcggcgacc 7560 

gagttgctct tgcccggcgt caacacggga taataccgcg ccacatagca gaactttaaa 7620 

agtgctcatc attggaaaac gttcttcggg gcgaaaactc tcaaggatct taccgctgtt 7680 

gagatccagt tcgatgtaac ccactcgtgc acccaactga tcttcagcat cttttacttt 7740 

caccagcgtt tctgggtgag caaaaacagg aaggcaaaat gccgcaaaaa agggaataag 7800 

ggcgacacgg aaatgttgaa tactcatact cttccttttt caatattatt gaagcattta 7860 

tcagggttat tgtctcatga gcggatacat atttgaatgt atttagaaaa ataaacaaat 7 92 0 

aggggttccg cgcacatttc cccgaaaagt gccacctgac gtctaagaaa ccattattat 7980 

catgacatta acctataaaa ataggcgtat cacgaggccc tttcgtcttc aagaattcat 8040 

accagatcac cgaaaactgt cctccaaatg tgtccccctc acactcccaa attcgcgggc 8100 

ttctgcctct tagaccactc taccctattc cccacactca ccggagccaa agccgcggcc 8160 

cttccgtttc tttgcttttg aaagacccca cccgtaggtg gcaagctagc gatgaccctg 8220 

ctgattggtt cgctgaccat ttccggggtg cggaacggcg ttaccagaaa ctcagaaggt 82 80 

tcgtccaacc aaaccgactc tgacggcagt ttacgagaga gatgataggg tctgcttcag 8340 

taagccagat gctacacaat taggcttgta catattgtcg ttagaacgcg gctacaatta 8400 

atacataacc ttatgtatca tacacatacg atttaggtga cactatagaa tacaagctgg 84 60 

aagatcttcc agcttgggct gcaggtcgac tctagagtcc gttacataac ttacggtaaa 852 0 

tggcccgcct ggctgaccgc ccaacgaccc ccgcccattg acgtcaataa tgacgtatgt 8580 

tcccatagta acgccaatag ggactttcca ttgacgtcaa tgggtggagt atttacggta 864 0 

aactgcccac ttggcagtac atcaagtgta tcatatgcca agtacgcccc ctattgacgt 87 00 

caatgacggt aaatggcccg cctggcatta tgcccagtac atgaccttat gggactttcc 8760 

tacttggcag tacatctacg tattagtcat cgctattacc atggtgatgc ggttttggca 8820 

gtacatcaat gggcgtggat agcggtttga ctcacgggga tttccaagtc tccaccccat 8880 

tgacgtcaat gggagtttgt tttggcacca aaatcaacgg gactttccaa aatgtcgtaa 8 94 0 

caactccgcc ccattgacgc aaatgggcgg taggcgtgta cggtgggagg tctatataag 9000 

cagagctcgt ttagtgaacc gcgccagtct tccgatagac tgcgtcgccc gggtacccgt 9060 

attcccaata aagcctcttg ctgtttgcat ccgaatcgtg gtctcgctgt tccttgggag 9120 

ggtctcctct gagtgattga ctacccacga cgggggtctt tcatttgggg gctcgtccgg 9180 

gatttggaga cccctgccca gggaccaccg acccaccacc gggaggtaag ctggccagca 9240 

acttatctgt gtctgtccga ttgtctagtg tctatgtttg atgttatgcg cctgcgtctg 9300 

tactagttag ctaactagct ctgtatctgg cggacccgtg gtggaactga cgagttctga 9360 

acacccggcc gcaaccctgg gagacgtccc agggactttg ggggccgttt ttgtggcccg 942 0 

acctgaggaa gggagtcgat gtggaatccg accccgtcag gatatgtggt tctggtagga 94 80 

gacgagaacc taaaacagtt cccgcctccg tctgaatttt tgctttcggt ttggaaccga 9540 

agccgcgcgt cttgtctgct gcagcgctgc agcatcgttc tgtgttgtct ctgtctgact 9600 

gtgtttctgt atttgtctga aaattagggc cagactgtta ccactccctt aagtttgacc 9660 

ttaggtcact ggaaagatgt cgagcggatc gctcacaacc agtcggtaga tgtcaagaag 972 0 

agacgttgg 9729 



<210> 57 
<211> 7591 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
oligonucleotide 

<400> 57 

gttaccttct gctctgcaga atggccaacc tttaacgtcg gatggccgcg agacggcacc 60 
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tttaaccgag acctcatcac ccaggttaag atcaaggtct tttcacctgg cccgcatgga 120 

cacccagacc aggtccccta catcgtgacc tgggaagcct tggcttttga cccccctccc 180 

tgggtcaagc cctttgtaca ccctaagcct ccgcctcctc ttcctccatc cgccccgtct 240 

ctcccccttg aacctcctcg ttcgaccccg cctcgatcct ccctttatcc agccctcact 300 

ccttctctag gcgccggaat tcgttaactc gaggatccac cggtcgccac catggtgagc 3 60 

aagggcgagg agctgttcac cggggtggtg cccatcctgg tcgagctgga cggcgacgta 42 0 

aacggccaca agttcagcgt gtccggcgag ggcgagggcg atgccaccta cggcaagctg 4 80 

accctgaagt tcatctgcac caccggcaag ctgcccgtgc cctggcccac cctcgtgacc 540 

accctgacct acggcgtgca gtgcttcagc cgctaccccg accacatgaa gcagcacgac 600 

ttcttcaagt ccgccatgcc cgaaggctac gtccaggagc gcaccatctt cttcaaggac 660 

gacggcaact acaagacccg cgccgaggtg aagttcgagg gcgacaccct ggtgaaccgc 720 

atcgagctga agggcatcga cttcaaggag gacggcaaca tcctggggca caagctggag 780 

tacaactaca acagccacaa cgtctatatc atggccgaca agcagaagaa cggcatcaag 840 

gtgaacttca agatccgcca caacatcgag gacggcagcg tgcagctcgc cgaccactac 900 

cagcagaaca cccccatcgg cgacggcccc gtgctgctgc ccgacaacca ctacctgagc 960 

acccagtccg ccctgagcaa agaccccaac gagaagcgcg atcacatggt cctgctggag 1020 

ttcgtgaccg ccgccgggat cactctcggc atggacgagc tgtacaagta aagcggccct 1080 

aggggtcttt cccctctcgc caaaggaatg caaggtctgt tgaatgtcgt gaaggaagca 1140 

gttcctctgg aagcttcttg aagacaaaca acgtctgtag cgaccctttg caggcagcgg 1200 

aaccccccac ctggcgacag gtgcctctgc ggccaaaagc caccgagttg gttcagctgc 12 60 

tgcctgaggc tggacgacct cgcggagttc taccggcagt gcaaatccgt cggcatccag 13 2 0 

gaaaccagca gcggctatcc gcgcatccat gcccccgaac tgcaggagtg gggaggcacg 13 80 

atggccgctt tggtcgaggc ggatccggcc attagccata ttattcattg gttatatagc 1440 

ataaatcaat attggctatt ggccattgca tacgttgtat ccatatcata atatgtacat 1500 

ttatattggc tcatgtccaa cattaccgcc atgttgacat tgattattga ctagttatta 1560 

atagtaatca attacggggt cattagttca tagcccatat atggagttcc gcgttacata 1620 

acttacggta aatggcccgc ctggctgacc gcccaacgac ccccgcccat tgacgtcaat 1680 

aatgacgtat gttcccatag taacgccaat agggactttc cattgacgtc aatgggtgga 1740 

gtatttacgg taaactgccc acttggcagt acatcaagtg tatcatatgc caagtacgcc 1800 

ccctattgac gtcaatgacg gtaaatggcc cgcctggcat tatgcccagt acatgacctt 1860 

atgggacttt cctacttggc agtacatcta cgtattagtc atcgctatta ccatggtgat 1920 

gcggttttgg cagtacatca atgggcgtgg atagcggttt gactcacggg gatttccaag 1980 

tctccacccc attgacgtca atgggagttt gttttggcac caaaatcaac gggactttcc 2040 

aaaatgtcgt aacaactccg ccccattgac gcaaatgggc ggtaggcatg tacggtggga 2100 

ggtctatata agcagagctc gtttagtgaa ccgtcagatc gcctggagac gccatccacg 2160 

ctgttttgac ctccatagaa gacaccggga ccgatccagc ctccgcggcc ccaagcttac 2220 

catgggggca ggtgccaccg gccgcgccat ggacgggccg cgcctgctgc tgttgctgct 22 80 

tctgggggtg tcccttggag gtgccaagga ggcatgcccc acaggcctgt acacacacag 2 340 

cggtgagtgc tgcaaagcct gcaacctggg cgagggtgtg gcccagcctt gtggagccaa 24 00 

ccagaccgtg tgtgagccct gcctggacag cgtgacgttc tccgacgtgg tgagcgcgac 2460 

cgagccgtgc aagccgtgca ccgagtgcgt ggggctccag agcatgtcgg cgccgtgcgt 252 0 

ggaggccgac gacgccgtgt gccgctgcgc ctacggctac taccaggatg agacgactgg 25 80 

gcgctgcgag gcgtgccgcg tgtgcgaggc gggctcgggc ctcgtgttct cctgccagga 2 640 

caagcagaac accgtgtgcg aggagtgccc cgacggcacg tattccgacg aggccaacca 2700 

cgtggacccg tgcctgccct gcaccgtgtg cgaggacacc gagcgccagc tccgcgagtg 2760 

cacacgctgg gccgacgccg agtgcgagga gatccctggc cgttggatta cacggtccac 2 82 0 

acccccagag ggctcggaca gcacagcccc cagcacccag gagcctgagg cacctccaga 2 880 

acaagacctc atagccagca cggtggcagg tgtggtgacc acagtgatgg gcagctccca 2940 

gcccgtggtg acccgaggca ccaccgacaa cctcatccct gtctattgct ccatcctggc 3000 

tgctgtggtt gtgggccttg tggcctacat agccttcaag aggtggaaca gctgctgagt 3060 

cgactctaga ggatccccaa catcgataaa ataaaagatt ttatttagtc tccagaaaaa 3120 

ggggggaatg aaagacccca cctgtaggtt tggcaagcta gcttaagtaa cgccattttg 3180 

caaggcatgg aaaaatacat aactgagaat agagaagttc agatcaaggt caggaacaga 3240 

tggaacagct gaatatgggc caaacaggat atctgtggta agcagttcct gccccggctc 3300 

agggccaaga acagatggaa cagctgaata tgggccaaac aggatatctg tggtaagcag 3360 

ttcctgcccc ggctcagggc caagaacaga tggtccccag atgcggtcca gccctcagca 3420 

gtttctagag aaccatcaga tgtttccagg gtgccccaag gacctgaaat gaccctgtgc 3480 

cttatttgaa ctaaccaatc agttcgcttc tcgcttctgt tcgcgcgctt ctgctccccg 3540 
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agctcaataa aagagcccac aacccctcac tcggggcgcc agtcctccga ttgactgagt 3600 

cgcccgggta cccgtgtatc caataaaccc tcttgcagtt gcatccgact tgtggtctcg 3660 

ctgttccttg ggagggtctc ctctgagtga ttgactaccc gtcagcgggg gtctttcatt 3720 

tgggggctcg tccgggatcg ggagacccct gcccagggac caccgaccca ccaccgggag 37 8 0 

gtaagctggc tgcctcgcgc gtttcggtga tgacggtgaa aacctctgac acatgcagct 3840 

cccggagacg gtcacagctt gtctgtaagc ggatgccggg agcagacaag cccgtcaggg 3 90 0 

cgcgtcagcg ggtgttggcg ggtgtcgggg cgcagccatg acccagtcac gtagcgatag 3960 

cggagtgtat actggcttaa ctatgcggca tcagagcaga ttgtactgag agtgcaccat 402 0 

atgcggtgtg aaataccgca cagatgcgta aggagaaaat accgcatcag gcgctcttcc 4080 

gcttcctcgc tcactgactc gctgcgctcg gtcgttcggc tgcggcgagc ggtatcagct 4140 

cactcaaagg cggtaatacg gttatccaca gaatcagggg ataacgcagg aaagaacatg 4200 

tgagcaaaag gccagcaaaa ggccaggaac cgtaaaaagg ccgcgttgct ggcgtttttc 4260 

cataggctcc gcccccctga cgagcatcac aaaaatcgac gctcaagtca gaggtggcga 4320 

aacccgacag gactataaag ataccaggcg tttccccctg gaagctccct cgtgcgctct 4380 

cctgttccga ccctgccgct taccggatac ctgtccgcct ttctcccttc gggaagcgtg 4440 

gcgctttctc atagctcacg ctgtaggtat ctcagttcgg tgtaggtcgt tcgctccaag 4500 

ctgggctgtg tgcacgaacc ccccgttcag cccgaccgct gcgccttatc cggtaactat 4560 

cgtcttgagt ccaacccggt aagacacgac ttatcgccac tggcagcagc cactggtaac 462 0 

aggattagca gagcgaggta tgtaggcggt gctacagagt tcttgaagtg gtggcctaac 4680 

tacggctaca ctagaaggac agtatttggt atctgcgctc tgctgaagcc agttaccttc 4740 

ggaaaaagag ttggtagctc ttgatccggc aaacaaacca ccgctggtag cggtggtttt 4800 

tttgtttgca agcagcagat tacgcgcaga aaaaaaggat ctcaagaaga tcctttgatc 4860 

ttttctacgg ggtctgacgc tcagtggaac gaaaactcac gttaagggat tttggtcatg 4 92 0 

agattatcaa aaaggatctt cacctagatc cttttaaatt aaaaatgaag ttttaaatca 4980 

atctaaagta tatatgagta aacttggtct gacagttacc aatgcttaat cagtgaggca 504 0 

cctatctcag cgatctgtct atttcgttca tccatagttg cctgactccc cgtcgtgtag 5100 

ataactacga tacgggaggg cttaccatct ggccccagtg ctgcaatgat accgcgagac 5160 

ccacgctcac cggctccaga tttatcagca ataaaccagc cagccggaag ggccgagcgc 522 0 

agaagtggtc ctgcaacttt atccgcctcc atccagtcta ttaattgttg ccgggaagct 5280 

agagtaagta gttcgccagt taatagtttg cgcaacgttg ttgccattgc tgcaggcatc 5340 

gtggtgtcac gctcgtcgtt tggtatggct tcattcagct ccggttccca acgatcaagg 5400 

cgagttacat gatcccccat gttgtgcaaa aaagcggtta gctccttcgg tcctccgatc 5460 

gttgtcagaa gtaagttggc cgcagtgtta tcactcatgg ttatggcagc actgcataat 552 0 

tctcttactg tcatgccatc cgtaagatgc ttttctgtga ctggtgagta ctcaaccaag 5580 

tcattctgag aatagtgtat gcggcgaccg agttgctctt gcccggcgtc aatacgggat 564 0 

aataccgcgc cacatagcag aactttaaaa gtgctcatca ttggaaaacg ttcttcgggg 5700 

cgaaaactct caaggatctt accgctgttg agatccagtt cgatgtaacc cactcgtgca 5760 

cccaactgat cttcagcatc ttttactttc accagcgttt ctgggtgagc aaaaacagga 5820 

aggcaaaatg ccgcaaaaaa gggaataagg gcgacacgga aatgttgaat actcatactc 5880 

ttcctttttc aatattattg aagcatttat cagggttatt gtctcatgag cggatacata 5940 

tttgaatgta tttagaaaaa taaacaaata ggggttccgc gcacatttcc ccgaaaagtg 6000 

ccacctgacg tctaagaaac cattattatc atgacattaa cctataaaaa taggcgtatc 6060 

acgaggccct ttcgtctcgc gcgtttcggt gatgacggtg aaaacctctg acacatgcag 6120 

ctcccggaga cggtcacagc ttgtctgtaa gcggatgccg ggagcagaca agcccgtcag 6180 

ggcgcgtcag cgggtgttgg cgggtgtcgg ggctggctta actatgcggc atcagagcag 624 0 

attgtactga gagtgcacca tatggacata ttgtcgttag aacgcggcta caattaatac 6300 

ataaccttat gtatcataca catacgattt aggtgacact atagaactcg actctagagt 6360 

ccgttacata acttacggta aatggcccgc ctggctgacc gcccaacgac ccccgcccat 6420 

tgacgtcaat aatgacgtat gttcccatag taacgccaat agggactttc cattgacgtc 6480 

aatgggtgga gtatttacgg taaactgccc acttggcagt acatcaagtg tatcatatgc 6540 

caagtacgcc ccctattgac gtcaatgacg gtaaatggcc cgcctggcat tatgcccagt 6600 

acatgacctt atgggacttt cctacttggc agtacatcta cgtattagtc atcgctatta 6660 

ccatggtgat gcggttttgg cagtacatca atgggcgtgg atagcggttt gactcacggg 672 0 

gatttccaag tctccacccc attgacgtca atgggagttt gttttggcac caaaatcaac 6780 

gggactttcc aaaatgtcgt aacaactccg ccccattgac gcaaatgggc ggtaggcgtg 6840 

tacggtggga ggtctatata agcagagctc gtttagtgaa ccgcgccagt cttccgatag 6900 

actgcgtcgc ccgggtaccc gtattcccaa taaagcctct tgctgtttgc atccgaatcg 6960 

tggtctcgct gttccttggg agggtctcct ctgagtgatt gactacccac gacgggggtc 7 02 0 
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tttcatttgg 
ccgggaggta 
tgatgttatg 
tggtggaact 
tgggggccgt 
aggatatgtg 
tttgctttcg 
tctgtgttgt 
taccactccc 
ccagtcggta 



gggctcgtcc 
agctggccag 
cgcctgcgtc 
gacgagttct 
ttttgtggcc 
gttctggtag 
gtttggaacc 
ctctgtctga 
ttaagtttga 
gatgtcaaga 



gggatttgga 
caacttatct 
tgtactagtt 
gaacacccgg 
cgacctgagg 
gagacgagaa 
gaagccgcgc 
ctgtgtttct 
ccttaggtca 
agagacgttg 



gacccctgcc 
gtgtctgtcc 
agctaactag 
ccgcaaccct 
aagggagtcg 
cctaaaacag 
gtcttgtctg 
gtatttgtct 
ctggaaagat 
9 



cagggaccac 
gattgtctag 
ctctgtatct 
gggagacgtc 
atgtggaatc 
ttcccgcctc 
ctgcagcgct 
gaaaattagg 
gtcgagcgga 



cgacccacca 
tgtctatgtt 
ggcggacccg 
ccagggactt 
cgaccccgtc 
cgtctgaatt 
gcagcatcgt 
gccagactgt 
tcgctcacaa 



7080 
7140 
7200 
7260 
7320 
7380 
7440 
7500 
7560 
7591 



<210> 58 
<211> 2870 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
oligonucleotide 



<400> 58 

gctagcataa 

ttccagggtg 

tcgcttctcg 

ccctcactcg 

taaaccctct 

tgagtgattg 

gacccctgcc 

tcggtgatga 

tgtaagcgga 

gtcggggcgc 

tgcggcatca 

atgcgtaagg 

gcgctcggtc 

atccacagaa 

caggaaccgt 

gcatcacaaa 

ccaggcgttt 

cggatacctg 

taggtatctc 

cgttcagccc 

acacgactta 

aggcggtgct 
atttggtatc 
atccggcaaa 
gcgcagaaaa 
gtggaacgaa 
ctagatcctt 
ttggtctgac 
tcgttcatcc 
accatctggc 
atcagcaata 
cgcctccatc 
tagtttgcgc 
tatggcttca 
gtgcaaaaaa 
agtgttatca 



cttcgtataa 

ccccaaggac 

cttctgttcg 

gggcgccagt 

tgcagttgca 

actacccgtc 

cagggaccac 

cggtgaaaac 

tgccgggagc 

agccatgacc 

gagcagattg 

agaaaatacc 

gttcggctgc 

tcaggggata 

aaaaaggccg 

aatcgacgct 

ccccctggaa 

tccgcctttc 

agttcggtgt 

gaccgctgcg 

tcgccactgg 

acagagttct 

tgcgctctgc 

caaaccaccg 

aaaggatctc 

aactcacgtt 

ttaaattaaa 

agttaccaat 

atagttgcct 

cccagtgctg 

aaccagccag 

cagtctatta 

aacgttgttg 

ttcagctccg 

gcggttagct 

ctcatggtta 



tgtatgctat 

ctgaaatgac 

cgcgcttctg 

cctccgattg 

tccgacttgt 

agcgggggtc 

cgacccacca 

ctctgacaca 

agacaagccc 

cagtcacgta 

tactgagagt 

gcatcaggcg 

ggcgagcggt 

acgcaggaaa 

cgttgctggc 

caagtcagag 

gctccctcgt 

tcccttcggg 

aggtcgttcg 

ccttatccgg 

cagcagccac 

tgaagtggtg 

tgaagccagt 

ctggtagcgg 

aagaagatcc 

aagggatttt 

aatgaagttt 

gcttaatcag 

gactccccgt 

caatgatacc 

ccggaagggc 

attgttgccg 

ccattgctgc 

gttcccaacg 

ccttcggtcc 

tggcagcact 



acgaagttat 

cctgtgcctt 

ctccccgagc 

actgagtcgc 

ggtctcgctg 

tttcatttgg 

ccgggaggta 

tgcagctccc 

gtcagggcgc 

gcgatagcgg 

gcaccatatg 

ctcttccgct 

atcagctcac 

gaacatgtga 

gtttttccat 

gtggcgaaac 

gcgctctcct 

aagcgtggcg 

ctccaagctg 

taactatcgt 

tggtaacagg 

gcctaactac 

taccttcgga 

tggttttttt 

tttgatcttt 

ggtcatgaga 

taaatcaatc 

tgaggcacct 

cgtgtagata 

gcgagaccca 

cgagcgcaga 

ggaagctaga 

aggcatcgtg 

atcaaggcga 

tccgatcgtt 

gcataattct 



tctagagaac 

atttgaacta 

tcaataaaag 

ccgggtaccc 

ttccttggga 

gggctcgtcc 

agctggctgc 

ggagacggtc 

gtcagcgggt 

agtgtatact 

cggtgtgaaa 

tcctcgctca 

tcaaaggcgg 

gcaaaaggcc 

aggctccgcc 

ccgacaggac 

gttccgaccc 

ctttctcata 

ggctgtgtgc 

cttgagtcca 

attagcagag 

ggctacacta 

aaaagagttg 

gtttgcaagc 

tctacggggt 

ttatcaaaaa 

taaagtatat 

atctcagcga 

actacgatac 

cgctcaccgg 

agtggtcctg 

gtaagtagtt 

gtgtcacgct 

gttacatgat 

gtcagaagta 

cttactgtca 



catcagatgt 

accaatcagt 

agcccacaac 

gtgtatccaa 

gggtctcctc 

gggatcggga 

ctcgcgcgtt 

acagcttgtc 

gttggcgggt 

ggcttaacta 

taccgcacag 

ctgactcgct 

taatacggtt 

agcaaaaggc 

cccctgacga 

tataaagata 

tgccgcttac 

gctcacgctg 

acgaaccccc 

acccggtaag 

cgaggtatgt 

gaaggacagt 

gtagctcttg 

agcagattac 

ctgacgctca 

ggatcttcac 

atgagtaaac 

tctgtctatt 

gggagggctt 

ctccagattt 

caactttatc 

cgccagttaa 

cgtcgtttgg 

cccccatgtt 

agttggccgc 

tgccatccgt 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
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aagatgcttt 
gcgaccgagt 
tttaaaagtg 
gctgttgaga 
tactttcacc 
aataagggcg 
catttatcag 
acaaataggg 
tattatcatg 
attcatacca 
gcgggcttct 
gcggcccttc 



tctgtgactg 
tgctcttgcc 
ctcatcattg 
tccagttcga 
agcgtttctg 
acacggaaat 
ggttattgtc 
gttccgcgca 
acattaacct 
gatcaccgaa 
gcctcttaga 
cgtttctttg 



gtgagtactc 
cggcgtcaac 
gaaaacgttc 
tgtaacccac 
ggtgagcaaa 
gttgaatact 
tcatgagcgg 
catttccccg 
ataaaaatag 
aactgtcctc 
ccactctacc 
cttttgaaag 



aaccaagtca 
acgggataat 
ttcggggcga 
tcgtgcaccc 
aacaggaagg 
catactcttc 
atacatattt 
aaaagtgcca 
gcgtatcacg 
caaatgtgtc 
ctattcccca 
accccacccg 



ttctgagaat 
accgcgccac 
aaactctcaa 
aactgatctt 
caaaatgccg 
ctttttcaat 
gaatgtattt 
cctgacgtct 
aggccctttc 
cccctcacac 
cactcaccgg 
taggtggcaa 



agtgtatgcg 
atagcagaac 
ggatcttacc 
cagcatcttt 
caaaaaaggg 
attattgaag 
agaaaaataa 
aagaaaccat 
gtcttcaaga 
tcccaaattc 
agccaaagcc 



2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2870 



<210> 59 
<211> 3097 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
oligonucleotide 

<400> 59 

gctagcttaa gtaacgccat tttgcaaggc atggaaaaat acataactga gaatagagaa 60 

gttcagatca aggtcaggaa cagatggaac agctgaatat gggccaaaca ggatatctgt 12 0 

ggtaagcagt tcctgccccg gctcagggcc aagaacagat ggaacagctg aatatgggcc 180 

aaacaggata tctgtggtaa gcagttcctg ccccggctca gggccaagaa cagatggtcc 24 0 

ccagatgcgg tccagccctc agcagtttct agagaaccat cagatgtttc cagggtgccc 3 00 

caaggacctg aaatgaccct gtgccttatt tgaactaacc aatcagttcg cttctcgctt 360 

ctgttcgcgc gcttctgctc cccgagctca ataaaagagc ccacaacccc tcactcgggg 420 

cgccagtcct ccgattgact gagtcgcccg ggtacccgtg tatccaataa accctcttgc 480 

agttgcatcc gacttgtggt ctcgctgttc cttgggaggg tctcctctga gtgattgact 54 0 

acccgtcagc gggggtcttt catttggggg ctcgtccggg atcgggagac ccctgcccag 600 

ggaccaccga cccaccaccg ggaggtaagc tggctgcctc gcgcgtttcg gtgatgacgg 660 

tgaaaacctc tgacacatgc agctcccgga gacggtcaca gcttgtctgt aagcggatgc 72 0 

cgggagcaga caagcccgtc agggcgcgtc agcgggtgtt ggcgggtgtc ggggcgcagc 780 

catgacccag tcacgtagcg atagcggagt gtatactggc ttaactatgc ggcatcagag 84 0 

cagattgtac tgagagtgca ccatatgcgg tgtgaaatac cgcacagatg cgtaaggaga 900 

aaataccgca tcaggcgctc ttccgcttcc tcgctcactg actcgctgcg ctcggtcgtt 960 

cggctgcggc gagcggtatc agctcactca aaggcggtaa tacggttatc cacagaatca 102 0 

ggggataacg caggaaagaa catgtgagca aaaggccagc aaaaggccag gaaccgtaaa 1080 

aaggccgcgt tgctggcgtt tttccatagg ctccgccccc ctgacgagca tcacaaaaat 1140 

cgacgctcaa gtcagaggtg gcgaaacccg acaggactat aaagatacca ggcgtttccc 12 00 

cctggaagct ccctcgtgcg ctctcctgtt ccgaccctgc cgcttaccgg atacctgtcc 1260 

gcctttctcc cttcgggaag cgtggcgctt tctcatagct cacgctgtag gtatctcagt 1320 

tcggtgtagg tcgttcgctc caagctgggc tgtgtgcacg aaccccccgt tcagcccgac 13 80 

cgctgcgcct tatccggtaa ctatcgtctt gagtccaacc cggtaagaca cgacttatcg 1440 

ccactggcag cagccactgg taacaggatt agcagagcga ggtatgtagg cggtgctaca 150 0 

gagttcttga agtggtggcc taactacggc tacactagaa ggacagtatt tggtatctgc 1560 

gctctgctga agccagttac cttcggaaaa agagttggta gctcttgatc cggcaaacaa 162 0 

accaccgctg gtagcggtgg tttttttgtt tgcaagcagc agattacgcg cagaaaaaaa 1680 

ggatctcaag aagatccttt gatcttttct acggggtctg acgctcagtg gaacgaaaac 1740 

tcacgttaag ggattttggt catgagatta tcaaaaagga tcttcaccta gatcctttta 1800 

aattaaaaat gaagttttaa atcaatctaa agtatatatg agtaaacttg gtctgacagt 1860 

taccaatgct taatcagtga ggcacctatc tcagcgatct gtctatttcg ttcatccata 1920 

gttgcctgac tccccgtcgt gtagataact acgatacggg agggcttacc atctggcccc 1980 

agtgctgcaa tgataccgcg agacccacgc tcaccggctc cagatttatc agcaataaac 2040 
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cagccagccg 
tctattaatt 
gttgttgcca 
agctccggtt 
gttagctcct 
atggttatgg 
gtgactggtg 
tcttgcccgg 
atcattggaa 
agttcgatgt 
gtttctgggt 
cggaaatgtt 
tattgtctca 
ccgcgcacat 
ttaacctata 
caccgaaaac 
tcttagacca 
ttctttgctt 



gaagggccga 
gttgccggga 
ttgctgcagg 
cccaacgatc 
tcggtcctcc 
cagcactgca 
agtactcaac 
cgtcaacacg 
aacgttcttc 
aacccactcg 
gagcaaaaac 
gaatactcat 
tgagcggata 
ttccccgaaa 
aaaataggcg 
tgtcctccaa 
ctctacccta 
ttgaaagacc 



gcgcagaagt 
agctagagta 
catcgtggtg 
aaggcgagtt 
gatcgttgtc 
taattctctt 
caagtcattc 
ggataatacc 
ggggcgaaaa 
tgcacccaac 
aggaaggcaa 
actcttcctt 
catatttgaa 
agtgccacct 
tatcacgagg 
atgtgtcccc 
ttccccacac 
ccacccgtag 



ggtcctgcaa 
agtagttcgc 
tcacgctcgt 
acatgatccc 
agaagtaagt 
actgtcatgc 
tgagaatagt 
gcgccacata 
ctctcaagga 
tgatcttcag 
aatgccgcaa 
tttcaatatt 
tgtatttaga 
gacgtctaag 
ccctttcgtc 
ctcacactcc 
tcaccggagc 
gtggcaa 



ctttatccgc 
cagttaatag 
cgtttggtat 
ccatgttgtg 
tggccgcagt 
catccgtaag 
gtatgcggcg 
gcagaacttt 
tcttaccgct 
catcttttac 
aaaagggaat 
attgaagcat 
aaaataaaca 
aaaccattat 
ttcaagaatt 
caaattcgcg 
caaagccgcg 



ctccatccag 
tttgcgcaac 
ggcttcattc 
caaaaaagcg 
gttatcactc 
atgcttttct 
accgagttgc 
aaaagtgctc 
gttgagatcc 
tttcaccagc 
aagggcgaca 
ttatcagggt 
aataggggtt 
tatcatgaca 
cataccagat 
ggcttctgcc 
gcccttccgt 



2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3097 



<210> 60 

<211> 42 

<212> DNA 

<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: Primer 
<400> 60 

ggctagagaa ttccaggtaa gatgggcgat cccctcacct gg 



<210> 61 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 61 

ttgggtactc ctcgctaggt tc 22 



<210> 62 
<211> 8 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
oligonucleotide 

<400> 62 
caggtaag 
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<210> 63 
<211> 512 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
oligonucleotide 

<400> 63 

cgcccgaaca gggacctgag aggggcgcag accctacctg ttgaacctgg ctgatcgtag 60 

gatccccggg acagcagagg agaacttaca gaagtcttct ggaggtgttc ctggccagaa 12 0 

cacaggagga caggtaagat gggagaccct ttgacatgga gcaaggcgct caagaagtta 180 

gagaaggtga cggtacaagg gtctcagaaa ttaactactg gtaactgtaa ttgggcgcta 24 0 

agtctagtag acttatttca tgataccaac tttgtaaaag aaaaggactg gcagctgagg 300 

gatgtcattc cattgctgga agatgtaact cagacgctgt caggacaaga aagagaggcc 360 

tttgaaagaa catggtgggc aatttctgct gtaaagatgg gcctccagat taataatgta 420 

gtagatggaa aggcatcatt ccagctccta agagcgaaat atgaaaagaa gactgctaat 480 

aaaaagcagt ctgagccctc tgaagaatat ct 512 



<210> 64 
<211> 514 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
oligonucleotide 

<400> 64 

cgcccgaaca gggacctgag aggggcgcag accctacctg ttgaacctgg ctgatcgtag 60 

gatccccggg acagcagagg agaacttaca gaagtcttct ggaggtgttc ctggccagaa 12 0 

cacaggagga caggtaagat tgggagaccc tttgacattg gagcaaggcg ctcaagaagt 18 0 

tagagaaggt gacggtacaa gggtctcaga aattaactac tggtaactgt aattgggcgc 24 0 

taagtctagt agacttattt catgatacca actttgtaaa agaaaaggac tggcagctga 3 00 

gggatgtcat tccattgctg gaagatgtaa ctcagacgct gtcaggacaa gaaagagagg 3 60 

cctttgaaag aacatggtgg gcaatttctg ctgtaaagat gggcctccag attaataatg 42 0 

tagtagatgg aaaggcatca ttccagctcc taagagcgaa atatgaaaag aagactgcta 480 

ataaaaagca gtctgagccc tctgaagaat atct 514 



<210> 65 
<211> 522 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
oligonucleotide 

<400> 65 

cgcccgaaca gggacctgag aggggcgcag accctacctg ttgaacctgg ctgatcgtag 60 

gatccccggg acagcagagg agaacttaca gaagtcttct ggaggtgttc ctggccagaa 120 

cacaggagga caggtaagat tgggagaccc tttgacattg gagcaaggcg ctcaagaagt 18 0 

tagagaaggt gacggtacaa gggtctcaga aattaactac tggtaactgt aattgggcgc 24 0 

taagtctagt agacttattt cattgatacc aactttgtaa aagaaaagga ctggcagctg 3 00 

agggattgtc attccattgc tggaagattg taactcagac gctgtcagga caagaaagag 3 60 
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aggcctttga aagaacattg gtgggcaatt tctgctgtaa agattgggcc tccag^ttaa 42 j?\ 
taattgtagt agattggaaa ggcatcattc cagctcctaa gagcgaaata ttgaaa'agaa 4 80 
gactgctaat aaaaagcagt ctgagccctc tgaagaatat ct \^ *' J 52 2' 

-v-, . 

-7 a r r *v r - > 
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